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METHOD FOR IDENTIFYING PEPTIDES THAT AFFECT PROTEIN-PROTEIN INTERACTIONS AND COMPLEMENT- 
MODULATING PEPTIDES 

RELATED APPLICATION 

This application claims priority to provisional aoolication No. 60/000674. filed June 29. 1995 under 35 
5 U.S.C. § 119(e). 

GOVERNMENT RIGHTS 

This invention was made with United States government support under grants R01 GM29831 and R01 
GM36960 awarded by the National Institutes of Health. The government has certain rights in the invention. 

FIELD OF THE INVENTION 

10 The present invention relates to modulating protein activity, and specifically relates to a general method 

of identifying regions of contact in protein-protein binding complexes, peptides or peptide analogs that modulate these 
orotein-protein interactions, and peptides or peptide analogs that modulate complement activity. 

BACKGROUND OF THE INVENTION 
A fundamental component of essentially all biological processes is the specific interaction of proteins with 

15 other molecules, in particular other proteins. Specific protein-protein interactions are essential for cellular 
maintenance, regulation, reproduction and death. Protein-protein interactions are required for basic processes such 
as intercellular adhesion and communication, signal transduction, and gene replication, expression, and regulation. 
Specific protein-protein interactions are also critical in specialized processes such as blood clotting and the immune 
system, including the complement system. Thus, the ability to modulate specific protein-protein interactions is a key 

20 to modulating essentially all biological processes. 

Protein-protein binding interactions occur at specific regions of contact on the protein surfaces and 
identification of the amino acid residues in these contact regions is important for controlling the binding interaction. 
For example, knowledge of the residues known to be involved in a contact site may be used to synthesize a relatively 
short peptide corresponding to the protein segment that includes these residues. This "interface'* peptide may be 

25 used directly to compete with and thus inhibit that specific protein-protein binding interaction. Interactions that have 
been successfully inhibited in this manner include those between the herpes virus protein Vmw65 and the Oct-1 
protein (Haigh, A. et aL, 1990. Nature 344:257-259), the subunits of ribonucleotide reductases from viral and 
mammalian sources (Dutia. B.M. et aL, 1986, Nature 321:439-441; Cohen, LA. et aL 1986. Nature 321:441443; 
Cosentino, G. et at., 1991, Biochem. Cell. Biol. 69:79 83), the subunits of the protease and reverse transcriptase of 

30 human immunodeficiency virus (HIV) (Zhang, Z.-Y. et al., 1991, 1 Biol. Chem. 266:15591*15594; Babe. LM. et aL, 
1992, ProL ScL 1:1244-1253; Schramm, HJ. et al., 1993, Biochem. Biophys. Res. Common. 194:595-600; Dhrita, 
G. et aL, 1994, J. Biol. Chem. 269:13080-13083), and the catalytic and accessory subunit, UL42. of herpes virus 
DNA polymerase (Digard, P. et aL, 1995, Proc. Natl. Acad. Sci. USA 92:1456-1460). 

An interface peptide may also be used indirectly to elicit an antibody which binds specifically to the intact 

35 protein. Such an antibody may be used to inhibit the protein-protein binding reaction. An interface peptide may be 
used as a structural model for design of a modified peptide, a peptide analog (Bianchi. E. et aL, 1995. J. Mol. Biol. 
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247:154-601 or peptidomimenc inhibitor (McDonnell J.M. et aL, 1996, Nature Structural Biol. 3(5):4 19426; 
Nakanishi, H. et aL, 1993, Gene 1 37:51 56), as has been accomplished for a ribonucleotide reductase protein (Uuzzi, 
M. et aL, 1994, Nature 372:695-698). If the protein-protein interaction is between a receptor and its ligand, an 
interface peptide may inhibit or substitute for the normal receptorligand interaction, and hence attenuate or mimic 
5 the normal cellular response depending on the signalling mechanism of the ligand-receptor binding. 

Currently used methods for identifying residues at the protein-protein interface typically include direct 
analysis of the three-dimensional structure of the protein-protein complex (e.g., X-ray crystallography or NMR 
spectroscopy), or systematically testing the binding activity of protein fragments or natural or engineered variants 
of these proteins. In the studies listed above, where synthetic peptides were used to block protein-protein 

10 interactions, crystallography data and genetically engineered modified proteins were used to identify interface 
residues in all cases except the herpes virus ribonucleotide reductase; in that case, a peptide designed to elicit an 
antibody response was fortuitously found to inhibit subunit association as well (Dutia, B.M. et aL, 1986, Nature 
321:439441; Cohen, E.A. et al., 1986, Nature 321:441-4431 All of the currently used methods are laborious and 
relatively unpredictable with regard to identifying interface peptides successfully, especially for proteins larger than 

15 a few hundred amino acid residues in length. X-ray crystallography requires crystallization of the protein complex 
in a form that can be analyzed. NMR spectrometry is limited by the size of the complex that can be analyzed. 
Methods involving modified proteins and their fragments require extensive testing, especially for larger proteins and 
without guidelines for identifying likely regions of contact. 

The complement system, part of the mammalian humoral immune system, lyses microorganisms and infected 

20 cells by forming holes in the their plasma membranes. The complement system consists of more than twenty plasma 
and membrane-bound proteins which interact to trigger and modulate complement activity. While normal complement 
function is essential for health, regulation of the effects of complement activation is also important. For example, 
uncontrolled complement activation can lead to adverse symptoms such as continuous inflammatory reactions in a 
variety of diseases (Vogt, W„ 1985, Trends Pharm. Set. 6: 114-119). Also, complement is the primary mediator of 

25 the hyperacute rejection of xenogeneic transplants which limits the clinical and research usefulness of 
xenotransplantation (Ryan, U.S., 1994, Xeno 2: 19-22: Piatt, J.L et aL, 1990, Immunol. Today 11: 450456). 
Indeed, xenotransplantation as a viable surgical procedure will certainly require drugs that limit complement activation. 
Furthermore, therapeutic treatments of the reperfusion injury associated with myocardial infarction will likely include 
modulation of the effects of complement activation (Homeister, J.W. et aL, 1994, Annu. Rev. Pharmacol Toxicol. 

30 34:1740). 

Despite a clear need for complement modulators, most of the currently available methods for inhibiting or 
depleting complement are not suitable for clinical applications because of their toxicity, undesirable side-effects or 
lack of efficacy in plasma (Vogt, W., 1985, Trends Pharm. ScL 6:114-119). One complement inhibitor being clinically 
tested is soluble complement receptor type 1 (sCRI), which can suppress hyperacute rejection in xenotransplantation 
35 IWeisman, H.F. et aL. 1990. Science 249:146-151: Pruitt, S.K. et aL, 1994, Transplantation 57:363-370). sCR1 
inhibits the complex C3 and C5 activating enzvmes (convenases) by facilitating their dissociation. Although effective, 
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sCRI is a targe (240 kOa) protein and there is a need for small peptide inhibitors which are more easily synthesized, 
chemically modified, or mimicked by small organic peptidomimetic molecules (Nakanishi. H. et aL. 1993. Gene 137:51- 
56). 

The complement proteins C3, C4, and C5 are excellent targets for functional intervention by complement 
5 inhibitors because they play central roles in activation and regulation of this system. Although the three proteins 
have distinct functions, their amino acid sequences are closely related, and they are encoded by genes that doubtless 
evolved from a common ancestor (Campbell, R.O. et aL 1988. Ann. Rev. Immunol. 6:161-195). Because of their 
close structural relationship, the C3. C4 and C5 proteins form a protein family. All three proteins interact with a 
number of other proteins. In particular, C3 binds to or transiently interacts with more than a half-dozen soluble 

10 proteins including another molecule of itself during complement activation and attenuation, and to a similar number 
of ceO-surf ace-bound proteins, which mediate immune-clearance, inflammatory, and complement regulatory activities. 
Researchers have identified the locations of sites within C3, C4 and C5 that interact with other complement proteins, 
including those in C3 that are recognized by inactivating proteases and their cofactors, cell-bound receptors and 
catalytic subunits in the complex complement convertases (Alsenz, J. et aL, 1992, Dev. Comp. Immunol. 16:63-76). 

15 Insertion/Deletion Sequences in Protein Families 

During the evolution and divergence of proteins, individual members of a protein family undergo insertion 
or deletion of amino acid residues by corresponding insertion or deletion of ONA sequences in the genes encoding 
these proteins. This process results in length polymorphisms in the protein family. Because a deletion in one 
member of a family is equivalent to an insertion in others, these insertion/deletions have been referred to as "tndek" 

20 (Kruskal, J.B., 1 983, m Time Warps, String Edits and Macromolecules: Theory and Practice of Sequence Comparison. 
D. Sankoff and J.B. Kruskal ed., pp. 144). 

The locations of indels are revealed when amino acid sequences are aligned to maximize their sequence 
identity. Alignments of relatively short sequences can be carried out manually. Computer programs such as PILEUP 
(Genetics Computer Group, Madison. Wl) and CLUSTAL W (Thompson, J.D. et aL, 1S94, Nuc. Acids Res. 22: 4673) 

25 are useful for aligning multiple long sequences. In aligning related sequences of different lengths, gaps are introduced 
into one or more family members to optimize the alignment of the total sequences. These gaps are indels. 

Indels have been shown to occur in portions of the amino acid sequence of a protein that appear on the 
surface of the protein in its native fotded state. Generally indels are short (1 to 5 residues) and occur at the protein 
surface as reverse turns or coils within loops rather than within secondary structural elements (a-helices and /?- 

30 strands), because these properties minimize perturbations of the core protein structure (Pascareila, S. & Argos, P., 
1992. J. Mol. Biol. 224: 461471: Sibanda. B.L & Thornton. J.M.. 1993, J. Mol. Biol 229: 428447). Mutagenesis 
studies have experimentally confirmed that proteins are relatively tolerant of insertions within surface loops (e.g., see 
Freimuth. P.I. et aL. 1990. J. Biol. Chem. 265: 896-901). Even within secondary structural elements, insertions can 
be tolerated if they occur at the protein surface (Betton, J.-M. et at 1993, FEBS Lett. 325: 34-38). 

35 
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SUMMARY OF THE INVENTION 
This patent application describes a method for identifying likely regions of protein-protein contact based on 
comparison of protein sequences and describes examples of the use of the method for designing peptides that inhibit 
or enhance the action of the mammalian complement system. The method relies on identification of indels generally 
5 comprising selecting amino acid sequences for at least two proteins that contain similar amino acid sequences in at 
least a portion of the sequences and wherein one sequence is that of a target protein. The similar sequences include 
identical and/or conserved amino acid residues, and the similar sequences are aligned by matching the identical and/or 
conserved amino acid residues. Using the aligned sequences, sites are identified that contain insertions and/or 

A 

deletions of amino acid residues in one protein relative to another protein, wherein the insertions and/or deletions 
10 define an indel. An indel assoctated peptide sequence can either span or flank an indel. Indel associated peptides 
include peptides of about 4 to about 20 amino acid residues in length that are located within 30 amino acid residues, 
or preferably within 20, 15, 10, 9, 8, 7, 6 or 5 or less amino acid residues of an indel identified in the ammo acid 
sequence of a target protein. 

According to one aspect of the invention, there is provided a method for identifying molecules that affect 

15 biological activity of a target protein. The method includes the steps of obtaining information regarding the location 
of an indel in an amino acid sequence of a target protein, obtaining a peptide fragment of the target protein, the 
peptide fragment having a sequence that is located in the amino acid sequence of the target protein within 30 amino 
acids or less of the indel or obtaining a peptidomimetic or peptide analog of the peptide fragment, and screening 
the peptide fragment, the peptidomimetic, or the peptide analog for its affect on biological or biochemical activity 

20 of the target protein. In a preferred embodiment, the screening step includes analyzing for modulation of protein 
activity, inhfertion of protein activity, activation or potentiation of protein activity, competition for binding to a 
protein, binding to a protein or ligand, substitution for a substrate of the target protein, substitution for a ligand of 
the target protein, or making an anti peptide antibody capable of modulating a biological activity of the target protein. 
In one embodiment of the method, the peptide fragment, the peptidomimetic, or the peptide analog directly affects 

25 the target protein whereas tn another embodiment the peptide fragment, the peptidomimetic, or the peptide analog 
indirectly affects the target protein. Another preferred embodiment further comprises the step of synthetically 
constructing a peptide, peptide analog, or peptidomimetic that affects the biological or biochemical activity of the 
target protein. One preferred embodiment is a method for making a pharmaceutical composition, including the step 
of obtaining a molecule identified as having biological or biochemical activity in accordance with the steps of this 

30 method and combining the molecule with a pharmaceutically-acceptable carrier. This method may further include the 
step of packaging the molecule in unit-dosage form. In a preferred embodiment of the method, the target protein 
is a protein of the mammalian complement system. Preferably, the target protein which is a protein of the 
mammalian complement system is C2, C3. C4. C5 or Factor B. In a prefered embodiment, the peptide fragment has 
a sequence of about 4 to about 20 amino acid residues in length. In another embodiment, the peptide fragment has 

35 a sequence of about 10 to about 20 amino acid residues tn length. In one embodiment, the peptide fragment has 
a sequence of about 4 to about 15 amino acid residues tn length. In another embodiment, the peptide fragment has 
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a sequence of about 5 to about 18 amino acid residues in length. In one embodiment, the peptide fragment is 
located within about 20 amino acid residues of an indel. In another embodiment, the peptide fragment is located 
within about 15 amino acid residues of an indel. In yet another embodiment, the peptide fragment is located within 
about 12 amino acid residues of an indel. In one embodiment the peptide fragment is located within about 10 amino 
5 acid residues of an indel. In another embodiment, the peptide fragment is located within about 9 amino acid residues 
of an indeL In yet another embodiment, the peptide fragment is located within about 8 amino acid residues of an 
indel. In one embodiment, the peptide fragment is located within about 7 amino acid residues of an mdel. In one 
embodiment the peptide fragment is located within about 6 amino acid residues of an indel. In other embodiments, 
the peptide fragment is located within about 5 or less amino acid residues of an indel or has a sequence that spans 
10 the indel or has a sequence located within the indeL 

According to another aspect of the invention, there is provided a method of identifying interface peptides 
for a target protein, including the steps of identifying the location of an indel in the amino acid sequence of a 
target protein, selecting an amino acid sequence from the target protein sequence overlapping or located within about 
30 amino acid residues or less of an amino- or carboxyl-terminus of an indel, obtaining a molecule that is a peptide 
15 having the selected amino acid sequence, a peptide analog of the selected amino acid sequence, or a peptidomimetic 
of the selected amino acid sequence, and evaluating the peptide, peptide analog or peptidomimetic in an assay to 
measure a change in activity of the target protein, wherein the change in activity is mediated directly or indirectly 
by the peptide, peptide analog or peptidomimetic. In one embodiment of the method, the peptide having the selected 
amino acid sequence is about 4 to about 20 amino acid residues in length. In an embodiment of the method, the 
20 peptide having the selected amino acid sequence is about 10 to about 20 amino acid residues in length. In another 
embodiment, the peptide having the selected amino acid sequence is about 4 to about 15 amino acid residues in 
length. In another embodiment, the peptide having the selected amino acid sequence is about 5 to about 18 amino 
acid residues in length. In one embodiment, the selected amino acid sequence has an amino- or carboxyl-terminal 
residue within about 20 amino acid residues of one terminus of the indel. In another embodiment, the peptide is 
25 located within about 15 amino acid residues of an amino- or carboxyl terminus of an indel. In one embodiment, the 
peptide is located within about 10 amino acid residues of an amino* or carboxyl terminus of an indeL In another 
embodiment, the peptide is located within about 9 amino acid residues of an amino- or carboxyl-terminus of an indeL 
In yet another embodiment, the peptide is located within about B amino acid residues of an amino- or carboxyl- 
terminus of an indeL In one more embodiment, the peptide is located within about 7 amino acid residues of an 
30 amino- or carboxyl-terminus of an indeL In another embodiment, the peptide is located within about 6 amino acid 
residues of an amino- or carboxyl terminus of an indeL In one embodiment, the peptide is located within about 5 
amino acid residues or less of an amino- or carboxyl terminus of an indel. In a preferred embodiment, the method 
further includes the step of making antibodies to the peptide, peptide analog or peptidomimetic, wherein the 
antibodies are capable of modulating an activity of the targ protein. In a preferred embodiment, the change in 
35 activity of the target protein is a decrease in activity, an increase in activity, utilization of a substrate different than 
the substrate normally utilized by the target protein, or binding to a iigand differently than the ligand binding activity 
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ordinarily demonstrated by the target protein. Preferably, the target protein is a protein of the mammalian 
complement system and most preferably is C2, C3, C4, C5 or Factor B. 

According to another aspect of the invention, there is provided a peptide for modulating activity of the 
complement system of a mammal comprising a sequence of about 4 to about 25 amino acid residues that occurs 
5 in an amino acid sequence of a mammalian complement protein, the peptide having an amino acid sequence in which 
an amino- or carboxykermmal residue is located within about 15 amino acid residues of an indel of the mammalian 
complement protein. In one embodiment, the peptide modulates activity of the mammalian complement system by 
directly or indirectly inhibiting an activity of a protein of the mammalian complement system. In another embodiment, 
the peptide modulates activity of the mammalian complement system by directly or indirectly enhancing an activity 

10 of a protein of the mammalian complement system. In a preferred embodiment the indel occurs within the amino 
acid sequence of a C2, C3, C4, C5 or Factor B protein. In another embodiment, the peptide further includes another 
molecule attached to the peptide. Another embodiment is an antibody that specifically recognizes such a peptide 
with another molecule attached to the peptide. Another preferred embodiment is a peptide analog or peptidomimetic 
molecule of a peptide according to this aspect of the invention. Another embodiment is an antibody that specifically 

15 recognizes such a peptide analog or peptidomimetic molecule. Another preferred embodiment is a pharmaceutical 
composition including a peptide, peptide analog or peptidomimetic molecule according to this aspect of the invention. 
A preferred embodiment is a pharmaceutical composition including an antibody that specifically recognizes a peptide, 
peptide analog or peptidomimetic molecule according to this aspect of the invention. In a prefered embodiment the 
peptide is located within about 12 amino acid residues of an indel of the mammalian complement protein. In another 

20 embodiment, the peptide is located within about 10 amino acid residues of an indel of the mammalian complement 
protein. 

According to another aspect of the invention, there is provided a peptide having a sequence of any one of: 
SEQ ID NO:53, SEQ ID N0:54, SEQ ID N0:55. SEQ ID N0:56, SEQ ID N0:57, SEQ ID N0:58, SEQ ID N0:72, SEQ 
ID N0:73, SEQ ID N0:74 or SEQ ID N0:75. 
25 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows a prototypic indel. revealed by alignment of portions of the ammo acid sequences of the 
human ("hum") and mouse ("mus") C3, C4, and C5 proteins, using the alignment program CLUSTAL W. The aligned 
ammo acid sequences correspond to sequences around indel number 7 in FIG. 2A-2E and include humC3 (SEQ ID 
NQ:1), musC3 (SEQ ID N0:2). humC4 (SEQ ID N0:3], musC4 (SEQ ID N0:4), humC5 (SEQ ID N0:5) and musC5 (SEQ 
30 ID N0:6|. 

FIG. 2A-2E shows the alignment of the complete amino acid sequences of the complement proteins human 
C3 (hC3; SEQ 10 N0:7), mouse C3 (mC3; SEQ ID N0:8), human C4 (hC4; SEQ ID NO:9), mouse C4 (mC4; SEQ ID 
N0:10), human C5 (hC5; SEQ 10 N0:11] and mouse C5 (mC5; SEQ ID NO: 12) with indels of two or more residues 
labeled under the sequences; seouences selected for peptide synthesis are underlined and labeled as in Table 3. 
35 Double underlining indicates overlap of the peptide sequences. 
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FIG. 3A-3D shows the alignment of amino acid sequences of the ONA polymerases from herpes simplex virus 
(HSV) type 1 (HSV-1; SEQ ID N0:13), HSV type 2 (HSV 2; SEQ ID N0:14), HSV type 6 (HSV-6; SEQ ID N0:15}, 
baculovirus (Baculo; SEQ ID NO: 16), ictafurid herpes virus 1 (CCV; SEQ 10 NO: 17), Choristoneura biennis 
entomopoxvirus (ENTPOX; SEQ ID N0:18) and hepatitis B virus (HepB; SEQ ID N0:19). Asterisks C) mark identical 
5 residues and periods (.) mark conserved residues. 

FIG. 4A-4C shows the alignment of amino acid sequences of the human C3 (HuC3; SEQ ID N0:7), mouse 
C3 (MuC3; SEQ ID N0:8), rat C3 (RATC3; SEQ ID N0:20) and guinea pig C3 (GUIPIG; SEQ ID N0:21) proteins. 
Asterisks (*) mark identical residues and periods (.) mark conserved residues. 

FIG. 5A-5C shows the alignment of amino acid sequences of retroviral polyproteins of the immunodeficiency 
10 viruses of humans (HUM, SEQ ID N0:22; and HIV-2, SEQ ID N0:23), Simian (SimianiV; SEQ ID N0:24), chimp 
(ChimpIV; SEQ ID N0:25) and cat (FelinelV; SEQ ID N0:26), of Rous sarcoma virus (RSV; SEQ ID N0;27), Moloney 
murine leukemia virus (MOLONEY; SEQ ID N0:28) and Friend murine leukemia virus (F-MULV; SEQ ID N0:29). 
Asterisks (*) mark identical residues and periods (.) mark conserved residues. 

FIG. 6A-6B shows the alignment of part of the amino acid sequences of retroviral polyproteins of the 
15 immunodeficiency viruses of humans (HIV-1, SEQ ID N0:22; and HIV-2, SEQ ID N0:23), Moloney murine leukemia virus 
(MOLONEY; SEQ ID N0:28) and Friend murine leukemia virus (F-MULV; SEQ ID N0:29). Asterisks (*) mark identical 
residues and periods (.) mark conserved residues. 

FIG. 7A 7B shows the alignment of amino acid sequences of ribonucleotide reductases from HSV 1 (SEQ 
ID N0:30), HSV-2 (SEQ ID N0:31), Epstein-Barr virus (EBV; SEQ ID NQ:32). human (SEQ ID N0:33), vaccinia virus 
20 (Vaccinia; SEQ ID N0:34), mouse (Mus; SEQ ID N0:35). yeast (Yeast; SEQ ID N0:36) r Escherichia coB (Coli; SEQ ID 
N0:37) and Hemophilus influenzae (H. infl.; SEQ 10 N0:38). Asterisks (*) mark identical residues and periods (.) mark 
conserved residues. 

FIG. 8A-8B shows the alignment of amino acid sequences of herpes virus Vmw65 (VP 16) proteins for HSV1 
(HSV TYPE1/F; SEQ ID N0:39), HSV-2 (HSV TYPE2/HG52; SEQ ID N0:40), bovine virus (BvHV Type1/P8-2; SEQ 10 
25 NQ:41), Varicella-Zoster virus (Var-ZosV/Dumas; SEQ ID N0:42) and equine viruses (EqHV Type4, SEQ ID N0:43; and 
EqHV Typel/AB4P, SEQ ID N0:44). Asterisks (*) mark identical residues and periods (.) mark conserved residues. 

FIG. 9 graphically shows inhibition of complement activity (CH50) by the C3 peptides II 4 (•), II 5 (C. 
□) II-8C (0, O) and III- 1 1 (+, *) with K, values for the dotted line (....) being 4 X 10 s , for the uniformly 

dashed line (--) being 6 X 10 5 and for the solid line ( ) being 2 X 10 s . 

30 FIG. 10 shows the peptide concentrations giving 50% inhibition of complement function as measured by 

hemolysis for the individual peptides listed under the bars on the X-axis. 

FIG. 11AHB shows the alignment of amino acid sequences of human C2 (humC2; SEQ ID NQ:45), mouse 
C2 |musC2; SEQ ID N0:46), human factor B (humBf; SEQ ID N0:47), mouse factor B (musBf; SEQ ID N0:48) and 
zebrafish factor B (ZebBf; SEQ ID N0:49). 
35 FIG. 12 graphically shows the inhibition of complement hemolytic activity by the C2 peptide El for 

concentrations of peptide indicated on the X-axis used without serum (*) or with serum U, O and □). 
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DETAILED DESCRIPTION OF THE INVENTION 
A general method is disclosed for identifying portion of proteins that are involved in specific protein-protein 
interactions or contacts with other molecules. Identifying these interactive portions of proteins constitutes a critical 
step in designing small molecule inhibitors of protein-protein interactions. This method involves the analysis of only 
5 the primary structures of related proteins and has been demonstrated to be effective in predicting the amino acid 
sequences of inhibitory peptides for the complement system. 

In the absence of a three-dimensional structure for a protein, identifying the amino acid residues that 
constitute a protein's interactive sites presents a daunting challenge, particularly for large proteins. For example, 
for the complement proteins C3, C4 and C5. this requires identifying such sites among greater than 1600 amino acid 

10 residues in each protein. 

Nevertheless, using other more time-consuming methods, researchers have identified sites within C3, C4 and 
C5 that interact with other complement proteins, including those in C3 that are recognized by inactivating proteases 
and their cofactors, cell-bound receptors and catalytic subunits in the complex complement convertases (Alsenz, J. 
et al„ 1992, Dev. Comp. Immunol 16:6376). A key step in the development of the present method was the 

15 inventor's observation that almost all of these reported interactive sites span or lie within a few amino acid residues 
of primary structural features known as indels. The method described here for identifying regions of specific protein- 
protein contact relies on analysis of length polymorphisms and on determination of the locations of indels in protein 
f amffies. This general method provides a means for identifying interactive sites on proteins with a high probability 
of success. 

20 For purposes of this application, certain terms are defined as follows. 

"Identity" means that all of the amino acid residues at a single position are identical when two or more 
protein sequences are aligned. Two or more Drotein sequences may have only limited regions of identity and gaps 
may be introduced into one or more sequences to aid in alignment of identical residues. 

"Conserved" amino acids means that non-identical amino acid residues share sufficient chemical similarity 
25 that they can be considered to form a functional group. That is, when amino acid residues for two or more protein 
sequences are compared, a single position of alignment may contain non-identical amino acids but the residues may 
be conserved for that position and functionally similar because of their chemical and/or structural similarity. For 
purposes of this application, the following groups of amino acids (indicated by their three letter code followed by 
their one letter code in parentheses) form groups of conserved amino acids: 
30 Phe (F) and Tyr (Y); 

He (I), Leu (L). Val (V) and Met (M): 
Ala (A), Ser (S) and Thr (T); 
Asn (N), His (H), Arg (R), Lys (K) and Gin (Ql; and 
Asn (N), Asp (0). Glu (E) and Gin (Q). 
35 For comparison of two or more protein sequences, conserved amino acids for any given position means that all of 
the amino acid residues at a single position are members of a single group of conserved amino acids although any 
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combination of amino acids within that conserved group is permissible. The definition of conserved residues used 
here follows closely that defined by the PAM250 matrix (Dayhoff f M.O., et al. 1978. In Atlas of Protein Seq uent 
and Structure. Vol. 5. suppl. 3 (Dayhoff, M.O., ed) p. 345, NBRF, Washington, DC). 

"Alignment" of amino acid sequences allows for insertion of spaces in one or more of the sequences of 
5 amino acid residues to maximize the number of positions having identical and/or conserved residues in the compared 
protein sequences using methods well known in the art. For purposes of alignment of more than two sequences, 
the identical or conserved residues do not have to be present in all of the sequences for any particular location (Le„ 
an identical or conserved residue may be identified even if it is identical or conserved for that locus only for two 
of the compared sequences). It will be understood that in an alignment of two or more similar sequences there can 
10 be a mixture of identical, conserved and non-identical nonconserved amino acid residues in the aligned sequences. 
The comparisons resulting in alignment can be done manually or using computer software programs such as, for 
example, the CLUSTAL W and PILEUP programs. 

An "interface peptide" is a peptide that includes an amino acid sequence that occurs in a protein at or near 
a specific protein-protein contact region. An interface peptide can specifically inhibit a protein-protein interaction by 
15 acting as a competitive inhibitor of the native protein or can serve as a model for making a modified peptide, a 
peptidomimetic or anti-peptide antibodies. 

An "indel" is a region of amino acid sequence that includes all "deleted" residues as shown in at least one 
amino acid sequence when aligned with another amino acid sequence(s), where the sequences share overall similar 
sequences of amino acid residues. That is, the compared sequences contain some identical or conserved ammo acid 
20 residues occurring at the same or approximately the same positions in the compared protein sequences. An indel 
spans the farthest extent of deleted (or inserted) amino acid residues as discussed in detail below and shown in FIG. 
1. In cases where insertion/deletions are less than ten residues apart (e.g., see indel 13 in FIG. 2A-2E), the entire 
segment containing the multiple insertions/deletions is regarded as a single indel. Indels can also occur at the amino- 
and carboxyl-terminal ends of a protein. 
25 A "divergent region" is a portion of an amino acid sequence which shows dissimilarities in the amino acid 

residues of at least two members of a protein family whose amino acid sequences are compared. The divergent 
region is bordered by segments of amino acid sequences having high sequence conservation among the family 
members. An indel is the focal point of a divergent region. That is, a divergent region is a segment of the amino acid 
sequence including the indel that is relatively polymorphic among the compared sequences and is bounded by highly 
30 conserved regions. 

When used to refer to the location of an amino acid residue of a peptide relative to an indel, "located 
within" means that at least one residue of the peptide is contained within the length of the sequence from at least 
one end of the indel For example, if a peptide is located within 30 ammo acids of an indel, at least one residue 
of the peptide is located within the sequence defined by residues 1 to 30 away from the indel. That is, at least 
35 one residue of the peptide is located between tne end of the indel and the residue located 31 residues from the end 
of the indel. 
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Ref erring to FIG. 1, an indel is shown as the region that includes all "deleted" residues, shown by spaces 
occupied by asterisks (*), with the indel being the focal point of a divergent region among the protein family members 
which is bordered by segments having high sequence conservation among the family members. The spaces (shown 
by asterisks (*}) are introduced into the amino acid sequences to maximize the alignment of the flanking amino acid 
5 residues of the related proteins. As defined herein, the indel shown spans the farthest extent of deleted (or inserted) 
amino acid residues, shown by the arrow symbols (" < " and ° > "), with "N n and "C" under the arrow symbols 
identifying tne amino and carboxyl termini of the indel. respectively. In FIG. 1, the position marked with an arrow 
( 4 ) is defined as being six residues away from the ammo terminal end of the indel. This figure compares small 
portions of the human and mouse amino acid sequences of the complement proteins C3, C4 and C5. Because of 
10 the multiple interactive properties of these proteins and their close sequence similarity, this protein family is used 
to demonstrate the utility of the disclosed method for identifying regions of proteins that are involved in interactions 
with other proteins. 

FIG. 2A-2E shows the alignment of the complete amino acid sequences of human and mouse C3, C4 r and 
C5 proteins, which reveals the positions of indels in this protein family. The residues shown in lower case tetters 
15 in C3 and C4 have previously been identified by other methods as interactive sites in these proteins. Some of these 
sites are listed in Table 1 with summaries of the functional features, thus showing that the reported interactive sites 
are generally indel-proximal as defined herein. 
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Table 1. Interactive sites near indels in the C3/C4/C5 protein family. 

Indel Mo. Functional Feature 

5 13-14' Activating protease cleavage site in C3, C4, and C5; located between indels 13 and 14 

are the binding sites for C3 or its proteolytic fragments with the following proteins: 
Factor B 
Factor H 

Complement receptor 1 (CRD 
Complement receptor 2 (CR2) 
Complement receptor 3 (CR3) 
Membrane cofactor protein (MCP) 

16' Specific protease cleavage sites in C3b, giving C3dKfdg 

Factor B binding site 

21' Factor H binding site 

CR2 binding site 

Cross-linking site of C4b to C3b thioester in the C5 convertase 

22 Specific inactivating proteolytic sites in C3b and C4b 

26 Properdin binding site 

10 * Indels that occur at intron/exon junctions. 

The occurrence of indels primarily as loops on protein surfaces led me to hypothesize that indels are in 

general involved in the interactions of proteins with other molecules. Hence, in the absence of an NMR or X-ray 

crystal structure of the protein, indels provide focal points for identifying amino acid residues within a protein 

sequence that are likely to be involved in interactions with other molecules. The disclosed method for identifying 

15 likely sites of protein-protein interactions is based on alignments of the amino acid sequences of related proteins and 
identification of indels in the protein family. Protein sequences in databanks such as GENBANK and SWISSPROT 
are readily available to those skilled in the art for use with this method which provides a rapid and effective 
approach to defining sites of protein-protein interactions compared to currently available procedures. This method 
has been used to identify linear amino acid sequences of peptides that significantly affect (inhibit or enhance) 

20 complement activity. 

The method for identifying protein-protein and protein-substrate interactive sites includes the steps of 
aligning the amino acid sequence of the target protein of interest with the sequences of one or more closely related 
proteins, and identifying indels in the protein family using this alignment. Because indels usually mark protein 
segments at the surface of a naturally folded protein, they also mark regions of potential protein interactive sites. 

25 Although only a subset of indei-nrnyimai regies may actually be involved in a particular protein-protein interaction, 
focusing on indels greatly reduces the task of finding the desired contact site(s) compared to currently used methods. 
Involvement of a particular indel-proximal region in protein-protein binding may be tested in several ways. For 
example, synthetic peptides containing bidet-associated sequences may be constructed and tested for modulating a 
particular protein-protein interaction by measuring the activity of the protein of interest, in the presence and absence 

30 of peptide, in an assay in which that interaction is obligatory. Alternatively, the peptides may be used to elicit anth 
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peptide antibodies which are similarly tested. The peptide may also be used as a structural model for construction 
of modified peptides or peptidomimetics which are tested in the same manner. Peptide-based tests for involvement 
of a particular indel-proximal region may fail to reveal interface peptide activity even if the region is involved in a 
protein-protein interaction because the assay may require the peptide to be in a three-dimenstonai conformation like 
that of the corresponding segment within the native protein. Additional methods for rndel testing include engineering 
proteins with sequence modifications at the indel. No matter which assay is chosen, focusing on indels greatly 
simplifies the search for interactive sites. 

Because a deletion in one member of a family is equivalent to an insertion in another and because the size 
of an indel may vary from one to about 50 amino acid residues, the location of an indel is defined as follows. From 
the point of view of an amino acid sequence containing a deletion relative to another protein (defined as the insertion- 
containing member) with which it shares a similar amino acid sequence, the indel site begins at the first residue that 
is missing in the deletion-containing member(s) relative to the insertion-containing member(s) and ends at the last 
residue that is missing in the deletion-containing member(s) relative to the insertion-containing member(s) (illustrated 
in FIG. 1). For a protein family in which an indel consists of a single amino acid insertion in one protein, the indel 
will be a single amino acid residue in length. For a protein family in which an indel consists of two or more residues, 
the indel will have amino and carboxyl termini (as indicated with the R N n and "C n in FIG. 1). The length of the indel 
is irrelevant to its identification or its predictive utility for identifying indel-associated peptides. Peptides associated 
with such indels may be proximal to either the amino terminus or the carboxyl terminus or may span the indel. Indel- 
associated peptides that flank an indel do not necessarily begin or end immediately adjacent to the amino- or 
carboxyl-terminus of the indel but can have an amino- or carboxyl-terminus within about 30 amino acid residues of 
the indel. 

In cases where insertion/deletions are less than ten residues apart and contain identical or conserved amino 
acid residues between them (e.g., indel 13 in FIG. 2A-2E), the entire segment is regarded here as a single indel. 
Indels can also occur at the amino- and carboxyl termini of a protein as indicated by length polymorphisms at the 
termini of related proteins (e.g., the carboxyl-termini of the protein sequences shown in FIG. 7A-7B). 

The process of identifying indels involves selection of sequences to be aligned, for example, from databanks 
such as GENBANK. Selection of sequences may be based on functional similarities of proteins, genetic relatedness, 
random scanning of database sequences for related sequences or other well-known molecular biology methods or 
combinations of methods. The selected sequences are aligned usually by using computer programs (e.g., PILEUP or 
CLUSTAL W programs) but may also be aligned manually. Although this process adequately identifies the positions 
of most indels. it will be understood by those skilled in the art that the location of some indels may be somewhat 
unsystematic depending on the method (e.g., algorithm) used to align the amino acid sequences and on the particular 
protein family members selected. This variability is most pronounced in regions of limited sequence similarity. Hence, 
avoiding unnecessary sequence dissimilarity, by careful selection of the individual protein family member sequences 
to be aligned, helps to minimize ambiguities in indel locations. Even if the exact indel location is uncertain, the 
method nevertheless gives an approximate location of a potential protein interactive site. 
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In selecting individual protein family memoers for comparison, sequence similarity and the functional 
properties of the proteins should be considered. The proteins must have sufficiently similar sequences to provide 
regions of unequivocal sequence alignment but must contain the length polymorphisms caused by insertions and/or 
deletions. For example, in the alignment of DNA polymerases shown in FIG. 3A-3D. the sequence similarities among 
5 the proteins is poor. Thus, the number and precise locations of indeis is ambiguous. In contrast, the alignment of 
the very similar C3 proteins from different species in FIG. 4A-4C shows only a single indel greater than one residue 
in length. Therefore, the alignment of C3 with C4 and C5 in FIG. 2A-2E is more informative of potential locations 
of interactive sites in C3 and functionally important sites that are unique for each member of this family. 

Specific types of protein-protein interactions may be examined by the indel method by appropriate choices 
10 of sequences that are compared (e.g., species-specific protein-protein contact sites as shown in FIG. 4A4C). Because 
it is known that some complement components are incompatible with components from other species (e.g., see Kai, 
S. et al., 1980, J. Immunol. 125:2409*2415), the single large indel in the C3 alignment of FIG. 4A4C may identify 
an interactive site that mediates such a species-specific interaction. Similarly, alignments of related protein 
sequences from closely related viruses having different tropisms may identify tropismrelated interactive protein sites. 
15 Alignment of sequences for homologous proteins from different tissues may reveal tissue-specific functional regions 
of the proteins. 

The selection of proteins for alignment can alter the appearance of indeis as shown in FIGS. 5 and 6, in 
which different subsets of retroviral polyproteins are aligned. As shown in FIG. 5A-5C, when all eight sequences 
are included in the analysis, the number and locations of indeis are somewhat ambiguous. In contrast when only 
four of these sequences are compared in FIG. 6A-6B the indeis become more clearly defined. These alignments 
demonstrate that for the clearest results, the sequences chosen must be similar enough to provide regions of 
significant sequence similarity. One skilled in the art can readily select sequences for making meaningful indel 
comparisons with a minimum of experimentation. 

The nature of the protein-protein interaction is another important consideration in using this method. For 
25 example, amino acid sequences involved in obligate interactions (e.g., the a and /? subunits of hemoglobin) tend to 
show less sequence variation and length polymorphism than segments involved in transient protein-protein 
associations. Furthermore, inhibitor-mediated disruption of protein-protein contacts in obligate complexes is generally 
difficult because, unless the complex is kinetically relatively unstable, the inhibitor can act only if it is present before 
or during complex formation. Peptidomimetics that readily enter the ceil may be used in these situations as is well 
30 known in the an. 

One aspect of this invention is that indeis are used as guides in designing synthetic interface peptides that 
modulate the interaction between two proteins. Although a practitioner skilled in the art may use personal discretion 
in selecting interface peptidels). the following two guidelines are useful. First, utilize sequences contained within the 
region of sequence divergence surrounding each indel as described above and illustrated in FIG. 1. That is, use only 
35 indel and flanking divergent sequences bounded by highly conserved sequences. I picture the divergent region as 
being at or near the protein surface, and hence that its sequence can vary without greatly perturbing the protein 



20 



SUBSTITUTE SHEET (RULE 26) 



WO 97/01578 



PCT/US96/10958 



-14- 

conformatton and is available for interactions with other proteins, tn contrast, the conserved residues are inferred 
to be invariant because they are closer to the protein core, taking part in intramolecular interactions that maintain 
the overall protein conformation and are physically less available for tntermolecular interactions. Second, select 
peptides that correspond in sequence to segments within the divergent region that are intermediate in hydropathy 
5 (i.e., amphipathic peptides). That is, those peptides having hydropathy between about 0.25 and 0.65 when calculated 
as the mean of the amino acid parameters as described by Fauchere, J.L., et al., 1983, Eur. J. Med. Chem. 10:369. 
Hydropathy can be determined by any of a number of well known methods. As shown in Tables 3 and 4 (discussed 
below), most effective peptide inhibitors have intermediate hydropathies. Although not all peptides with intermediate 
hydropathies show inhibitory activity (e.g., peptide 1-61 none of the strongly hydrophilic peptides (hydropathy less 

10 than 0.25) showed 50% or more inhibition in an in vitro assay at concentrations of less than or equal to 150 |/M 
(discussed in detail in Example 3). 

Using the indel method, one skilled in the art can select segments of the target protein sequence for 
designing and synthesizing peptides, peptide analogs and peptidomimetics. Indel associated peptides include peptides 
of about 4 to about 20 amino acid residues in length that are located within 30 amino acid residues, or preferably 

15 within 20, 15, 10, 9, 8, 7, 6 or 5 or less residues of an indel identified in the amino acid sequence of a target 
protein. In selecting the amino acid sequences for making peptides (and for designing peptide analogs or 
peptidomimetics of the peptides), certain combinations of peptide length and distance relative to the indel are 
preferred. A peptide length of about 20 amino acid residues, located within about 10 residues of an indel is 
preferred, and a peptide of about 18 amino acid residues, located within about 8 residues of an indel is particularly 

20 preferred. Most preferably, the selected peptide includes about 15 amino acid residues and is located within about 
6 residues of an indel. 

Once an interface peptide is identified using the method and guidelines, it may be modified with respect 
to length and sequence for maximum effect using techniques well known to those skflled in the art. Modification 
by inclusion of conserved residues, presumably from the core of the folded protein, may be desirable to stabilize the 

25 conformation of the peptide in a form similar to that of the native protein. Peptides may also be acetylated at their 
amino terminus and/or amidated at their carboxyl terminus. Standard solid-phase peptide synthetic techniques allow 
for essentially unlimited quantities of the synthesized peptide of interest to be chemically produced (e.g., see Erickson 
& Merrifield. The Proteins, 3rd Ed.. Vol. 2. Chapt. 3, Academic Press, New York, 1976; and Merrifield & Barany, 
The Peptides: Analysis. Synthesis. Biology . Vol. 1. Chapt. 1, Gross & Meinenhofer. eds.. Academic Press, New York, 

30 1980). 

Peptide analogs and peptidomimetics are molecules that are modeled on a known peptide and synthesized 
generally to force a desired conformation on a peptide or peptide like molecule by introducing conformational 
restraints (McDonnell, J.M. et al., 1996, Nature Structural Biol. 3(51:419-426; Nakanishi, H. et al., 1993, Gene 
137:51-56; Liuzzi, M. et al.. 1994. Nature 372:695-698). That is. a successful peptide analog or peptidomimetic 
35 includes appropriate functional groups positioned on a relatively rigid molecular framework. Conformational restraints 
may be introduced in a variety of ways, all well known in the art. including using ammo acid residues that display 
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strong conformational tendencies, covalently cyciizing a peptide backbone (e.g., by introducing a disulfide bond 
between two cysteine residues in a peptide sequence), and introducing bulky chemical groups as side chains or as 
terminal modifications to a peptide (e.g., introducing one or more (C 6 H 5 )CH ? groups at the amino-terminus of a 
peptide). 

5 Peptide analogs and peptidomimetics mav be designed in a variety of ways yielding molecules that mimic 

the peptide of interest. One type of analog consists of the amino acid sequence of the identified peptide of interest 
with cysteine residues added at the amino- and carboxyl termini of the peptide. The resulting peptide is then oxidized 
to join covalently the terminal cysteine sulphydryl side chains, producing intrachain disulfide bridge and forming a 
cyclic peptide. Moreover, the corresponding linear version of the peptide is easily produced by reducing the disulfide 

10 bond with a reducing agent such as dithiothreitol (DTT) or by reductive alkylation with vinylpyridine (McDonnell J.M. 
et aL, 1996, Nature Structural Biol. 3(5):41 9-426). 

Peptide analogs may contain some or all Damino acid residues substituted for the L-amino acid residues 
in the peptide sequence of interest. D-amino acid containing peptides are often preferred for in vivo use because 
of their greater serum stability compared to peptides containing L-amino acids. One type of D-amino acid containing 

15 peptide analog is a retro-enantiomeric peptide consisting of only D-amino acid residues occurring in the reverse order 
of the sequence of L-amino acids in the peptide of interest. Such a retro-D-amino acid peptide is an isomer of the 
corresponding L-amino acid peptide, having reversed peptide bond orientation but theoretically identical side-chain 
topology as in the corresponding L-amino acid peptide. (McDonnell, J.M. et al., 1996, Nature Structural Biol. 
3(5):419-426; Guptsarma, P., 1992, FEBS Lett. 310:205-210; Goodman, M. & Chorev, M., 1979, ACC Cheat Res. 

20 12:1-7). 

Another type of peptide analog retains the ammo acid composition of the peptide of interest but consists 
of a "scrambled" amino acid sequence having any one of the finite number of possible combinations for that amino 
acid composition. One type of peptidomimetic uses structures within the peptide sequence that mimic the reverse- 
turns that occur naturally in proteins. That is, the peptide includes a /?*turn that causes the peptide chain to reverse 

25 direction, for example, by limiting the distance between the Ca of the first residue and Ccr of the fourth residue to 
about 4 to 7 A. Such peptides are readily produced by a macrocyclization reaction involving the use of an 
azetidinone as an activated ester during peptide synthesis INakanishi, H. et aL, 1993, Gene 137:51-56). Similarly, 
peptides containing a d-turn can also be synthesized to produce d-turn peptidomimetics. 

Combinations of any of these technioues may also be used to produce peptide analogs and peptidomimetic 

30 molecules. For example, a peptide analog may be produced by using a scrambled sequence of the peptide of interest 
and introducing one or more D-amino acid residues into the peptide. 

Synthesis of peptide analogs and peptidomimetics is well known in the art and generally involves: (1) 
molecular modeling of a known peptide sequence which can be performed using any cf a variety of molecular 
modeling computational chemistry methods (e.g., those available as Discover™ and Homology™ from Biosym 

35 Technologies Inc. or the PROCHECK program of Laskowski. R.A. et al., 1993, 1 AppL Crystallogr. 26:283-291), and 
(2) chemical synthesis using well known peptide synthesis or organic synthesis techniques. Synthesis of peptide 
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anaiogs or peptidomimetics generally uses modifications of solid-phase peptide synthesis protocols (see, e.g., the 
synthetic methods discussed in Nakanishi. H. et aL 1993. Gene 137:51-56 and McDonnell J.M. et at 1996, Nature 
Structural BioL 3(51:419426). For example, peptide analogs and peptidomimetics may be produced using standard 
protein synthetic chemistry on an automated system (e.g., Applied Biosvstems 430A peptide synthesizer) but including 
5 one or more D-ammo acid residues or bv modifying the synthesis to include a macrocyclization reaction as discussed 
above. The synthesized peptides are then readily purified by any of a variety of techniques such as reverse phase 
high pressure liquid chromatography (RP HPLC). Screening of existing chemical libraries or natural products for 
molecules that mimic the peptide of interest is an alternative method of identifying peptidomimetics without 
synthesizing the molecules de novo, 

10 Using the guidelines presented in this disclosure for identifying indel-associated peptides greatly aids one 

in producing peptides, peptide analogs or peptidomimetics with biological activity. Nonetheless, even similar peptides 
associated with an indel can vary considerably in their activity. As shown in Table 2, the inhibitory activities of four 
overlapping peptides that were selected as candidate inhibitors based on their proximity to indel 25 (see FIG. 2A-2E) 
varied from very effective (peptide III 80 to relatively ineffective ('peptides II8A and III8B) inhibitors, while one 

15 peptide (It-8) was difficult to chemically synthesize and purify. Despite these difficulties, the method permitted rapid 
identification of one interface peptide from four candidate sequences. Thus, the method permits relatively efficient 
identification of active interface peptides with a minimum of empirical testing compared to other methods of 
searching for interactive sites and functionally active peptides. 

Table 2. Complement inhibitory activity of peptides having 
20 indel 25-proximal C3 segments. 

SEQ ID PEPTIDE SEQUENCE IH^ 



NO: (jua) 

25 50 II- 8 Ac - D ATMS I LD I S MMTG NT" 

51 II-8A Ac - D ATMS I LD I SMMTG - NH 2 > 300 

52 III-8B Ac - DQDATMS I LD I SMM - NH 2 > 400 

53 III-8C Ac - S ILD I SMMTGFAPDT - NH 2 60 



30 

* Not tested ("NT") because the peptide synthesis failed. 
Arrow (T) shows location of indel. 

35 

The identification of candidate indelproximal interface peptides is inexact but can readily be determined by 
a practitioner skilled in the art using the techniques de?" ibed herein. Using this method, the amount of 
experimentation needed to confirm the in vitro or in vivo auivity of potential interface peptides is significantly 
reduced because of the relatively high probability of predicting interface peptide sequences. In practice, the activity 
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of an individual peptide, peptide analog or pepttdomimeiic may depend on its three-dimensional conformation, its 
solubility in physiological conditions and its ability to be chemically or biologically synthesized and purified. These 
synthesis, purification and assay techniques, however, are well known in the art and the activity of individual 
peptides, peptide analogs and peptidomimetics can be readily determined using standard techniques. 
5 The utility of the present method for identifying interface peptides is supported by previous studies that 

have identified interface protein sequences using other methods. That is, the indel method described here has been 
applied retrospectively to active interface peptides thai were identified using other methods. Where interface 
peptides have been shown to inhibit a protein-protein interaction, the inhibitory peptides are indel-proximal as defined 
by the present method. These include interface peptides that have been used to inhibit (1) the HIV protease (Zhang, 

10 Z.Y. et al., 1991, J. Biol. Chem. 266:15591-15594; Babe, L.JVL et al., 1992, Prot. Scl 1:1244-1253; Schramm 
HJ. et al., 1993, Biochem. Biophys. Res. Commun. 194:595-600), (2) HIV reverse transcriptase (Divita, 6. et aL 
1994, J. Biol. Chem. 269:13080-13083), the (3) herpes virus and human ribonucleotide reductases (Dutia, B.M. et 
aL, 198B, Nature 321:439-441; Cohen, E.A. et al., 1986, Nature 321:441443; Cosentino. G. et al., 1991, Biochem. 
Cell. Biol. 69:79-83), and (4) the herpes virus Vmw65 transcriptional control protein (Haigh, A. et al., 1990, Nature 

15 344:257-259). These inhibitory peptide sequences had been selected on the basis of X-ray crystaliographic structural 
information, by fortuitous observation of inhibitory activity and analogy to related proteins, and by individual testing 
of modified proteins. 

When the sequences of the above four proteins were aligned (using the CLUSTAL W program) with those 
of related proteins to display indels, in all cases the inhibitory interface peptides corresponded in sequence to 

20 segments near indels as illustrated in FIGS. 6 to 8. Only a single previously reported interface peptide inhibitor could 
not be unequivocally associated with an indel: the peptide of the herpes virus DNA polymerase identified by Digard, 
P., et al., 1995, Proc. Natl. Acad. Sci. USA 92:1456-1460. The sequence of this polymerase is shown aligned with 
sequences of related polymerases in FIG. 3A-3D which shows that the sequences chosen for comparison are too 
dissimilar to give reliable indel locations thus precluding association of the interface peptide with an indel. Hence. 

25 with one possible exception, in previously published cases where interface peptides have been used successfully, 
these peptides are indel-proximal as defined by the disclosed method. These observations, together with the success 
in identifying complement inhibitory peptides based on indel proximity described in the Examples below, support the 
validity of the indel method for identifying interface peptides. 

The examples focus on identifying inhibitory peptides based on analysis of indels of the complement 

30 C3/C4/C5 and C2/factor 8 protein families. The C3/C4/C5 family has well-defined structural characteristics useful 
for confirming the efficiency of the method for active peptide identification. It will be understood by those skilled 
in the art that these nonlimiting Examples provide a foundation for identifying active peptides for other protem protetn 
interactions for which primary amino acid sequence information is known which would allow one to identify indels. 
< such as, for example, the protein families illustrated tn FIGS. 6 through 8. 

35 The present invention includes a useful method for identifying peptides that can inhibit the interactions of 

proteins with other molecules including substrates and other proteins, such as in enzymatic protein-protein 
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inieractions. This can be especially important in modulating activation of proteins involved in complex biological 
processes such as complement activation. By modulating complement activation, inhibitory peptides may be useful 
for preventing and treating human pathotogtcai conditions associated with activation of the complement system 
including tissue rejection associated with xenotransplantation of organs, limb and gut ischemia, ischemia-reperfusion 
5 following myocardial infarction, stroke, aneurysm, hemorrhagic shock, and crush or thermal injury, anaphylaxis, and 
any of a variety of chronic inflammation conditions. Autoimmune disorders associated with increased complement 
activity, including systemic lupus erythmatosis. rheumatoid arthritis and multiple sclerosis may also be regulated by 
inhibitory peptides or antibodies generated using peptides Designed by the tndel method. Thus, these new 
therapeutics may improve the current treatments of these pathological conditions. 

10 For formulations of a pharmaceutical composition containing a biologically active peptide, peptide analog 

or peptidomimetic compound identified by identifying an indel-associated peptide sequence, an effective amount of 
the peptide, peptide analog, peptidomimetic or mixtures thereof, is admixed with a physiologically acceptable carrier 
suitable for administration to mammals including humans. The peptides, peptide analogs or peptidomimetics may be 
covalentty attached to each other, to other peptides or protein carriers or to other carriers such as by incorporation 

15 into lipid vesicles. Moreover, for peptides, analogs or peptidomimetics that mediate their biological effect indirectly 
by eliciting an anti peptide immune response in the recipient mammal, the peptide, peptide analog or peptidomimetics 
are mixed with an adjuvant or adsorbent as is well known in vaccine art. The peptides, peptide analogs and/or 
peptidomimetics may be delivered to the mammal to be treated in any of a variety of known methods including but 
not limited to systemic delivery via i.v., i.m., i.d. or s.c, or i.p. injection of a solution, suspension or lipid 

20 encapsulated form. 

Although the examples show use of the method to identify inhibitory peptides, it will be appreciated that 
the indel method may also be used to look for peptides that potentiate or enhance protein-protein interactions. 
Because peptides near to or overlapping with an tndel sequence represent sections of protein that have a higher 
potential for being involved in protein-protein interactions, such peptides may also be used to potentiate any reaction 

25 dependent on protein-protein interaction. One such example of a complement potentiating peptide is peptide C5-D2 
discussed in Example 3. Although the mechanism of potentiation is unknown, one explanation is that the peptide 
interferes with binding of a complement down-regulatory protein, resulting in increased complement activity. 

One general mechanism of peptide medtaied activity is that an indei proximal peptide selected for a target 
protein binds to a receptor of that target protein and acts either as an agonist or an antagonist of the normal cellular 

30 response associated with protein-receptor binding. For example, a growth factor peptide identified by its indel 
proximity may instigate or potentiate cell division when the peptide binds to the cellular receptor tor the growth 
factor protein. The indel method is used to select and design peptides, peptide analogs and peptidomemetics that 
are then tested for their ability to potentiate or enhance cellular responses. Such testing involves the use of well- 
known in vivo or in vitro assays available for a wide variety of known cellular responses. 

35 A variety of animal models are available to test the efficacy of peptides identified by the indel method for 

their relevance to the treatment of human medical conditions. These include in vivo animal models include the 
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following nonlimiting models for: acute myocardial infarction (Weisman. H.F. et aL 1990, Science 249: 146); rejection 
of xenograft transplants (Leventhal, J. et aL, 1993, Transplantation 55: 857); ischemia related to stroke (Chang, L 
ex ai.. 1992, J. Cereir. Blood Flow Metab. 12: 1030); cardiopulmonary bypass (Nilsson, L et aL, 1990. Artif. Organs 
14: 46); pancreatitis (Steer, M.. 1992. Yale 1 Biol Med. 65: 421) and nephritis (Picler. R. et aL, 1994, Am. J. 
5 Pathol 144: 91 5). Ex vivo perfusion of animal organs (e.g., heart) may serve as a measurement for peptide inhibition 
of complement-mediated tissue destruction during xenotransplantation (Morgan, B.P., 1995, Immunol. Today 16: 257). 

Inhibitory peptides, peptidomimetics, antibodies to peptides or combinations thereof may be administered 
by a variety of methods well known in the art including intravenous injection, oral and intranasal routes, 
intraperitoneal, intradermal, intramuscular and subcutaneous injection. A variety of delivery systems may be employed 

10 including injection of pharmacologically acceptable solutions or suspensions, encapsulation in liposomes or by 
controlled release methods well known in the art. In vitro treatment of donor organs with prior to transplantation 
into a recipient is contemplated. Pharmaceutical formulations containing peptides, peptidomimetics, antibodies to 
peptides or combinations thereof in the range of about 10 //g/kg to 1 g/kg may include other active ingredients 
including antibiotics, immunosuppressive drugs and growth factors. Animals made transgenic for inhibitory peptides 

15 where the peptide is expressed in the animal's organs may also serve as a source of organs for xenotransplantation 
to avoid hyperacute rejection (Morgan, 6.P., 1995, Immunol Today 16: 257). 

The disclosed method is a general method for identifying potential interactive sites in a protein and for using 
this information to guide the design of peptides, peptide analogs, or peptidomimetics that inhibit, enhance, or mimic 
the activity of a target protein. The method includes the following steps. 

20 First, the known amino acid sequences of two or more proteins that share similar amino acid sequences 

are compared as described earlier. One of the amino acid sequences is that of the target protein for which 
modulating/mimicking peptides are sought. The similarity between the protein sequences based on sequences of 
identical and/or conserved amino acid residues between proteins for comparison should be in the range of about 
0.25% to about 70% for identity, about 1% to about 20% for conserved amino acid residues and about 1% to about 

25 80% for a combination of identical and conserved amino acid residues. The preferred ranges are about 3% to about 
25% sequence identity and most preferably about 5% to about 15% sequence identity between the compared amino 
acid sequences. Preferably the compared sequences have about 4% to about 15% conserved amino acid residues 
and most preferably about 8% to about 12% conserved amino acid residues. The compared sequences have 
preferably about 7% to about 50% for a combination of identical and conserved amino acid sequence and most 

30 preferably about 12% to about 30% for the combined identical and conserved amino acid residues. It will be 
understood that these numbers represent the average of identical and/or conserved amino acid residues over the entire 
amino acid sequences of the compared proteins and that limited portions of the sequences may have greater or lesser 
percentages of identical or conserved amino acid residues. 

After aligning the sequences, the indel(s) in the group of proteins are identified. From these identified indels, 

35 a peptide sequence that corresponds to a segment of the protein sequence and includes amino acid residues that 
span or occur within about one to about ten residues of an amino terminus or a carboxyl terminus of an indel are 
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selected. Peptides are selected using the two guidelines described earlier, and are synthesized using any of a variety 
of molecular genetic and chemical methods. Using well-known assays, the effects of the peptides on the activity 
of the target protein or proteins that interact with the target protein are evaluated. 

It will be appreciated by one skilled in the art that proteins may share limited similarity when their total 
5 amino acid sequences are compared, but may still be related for portions of their amino acid sequences. For such 
proteins, this method would still be useful for identifying peptides associated with indels in those portions of the 
proteins that share greater amino acid sequence similarity compared to the dissimilar portions of the proteins. Thus, 
although the total percentage of identical or conserved amino acid sequence between two or more proteins may be 
relatively low. alignment of even limited portions of the proteins may identify indels useful for identifying peptides 

10 or other molecules that can affect the activities of the proteins. Indels are highly variable in size, ranging from one 
to about 50 residues in length. The length of the indel is not relevant; the identification of an indel, no matter how 
short or long, is a key step leading to selection of indel-proximal peptides. To be effective, the selected peptide 
sequence may range in size from about 4 to 25 amino acid residues in length including peptides of about 4 to 15 
or 4 to 20 amino acid residues in length. 

15 A peptide of any length whose amino and carboxy termini are both within about 15, 12 or 10 residues of 

an indel is included within the scope of this invention if the peptide sequence spans a large indeL For example, 
hypothetical protein sequences "A" and "8" are shown at an indel (indicated by asterisks) and a peptide from 
sequence "B" that spans the indei is undefined in sequence M B n . 

Sequence M A" : AAAAAAAAAAAA******************AAAAAAAAAAAA 
20 Sequence "B": BBBBBB BBBBBBBBBBBBBBBBBBBBBBBBBBBBBB BBBBBB 

The peptide corresponding to the underlined segment in sequence "B" is 30 residues long and within 6 residues of 
each end of the indeL Hence a longer peptide is included in the invention because of its proximity to the indel by 
spanning afl of or a portion of the indel. For indels that are particularly long, any peptide that has proximity to the 
indei, including those completely located within the sequence that corresponds to the indel are within the scope of 
25 this invention. For example, using the Sequences "A" and "B" above, a peptide located within the indei would be 
a series of residues from "B" sequence that corresponds to the sequence indicated by the series of asterisks in the 
"A" sequence. 

Selection of the indel proximal peptide may be baseo on peptide characteristics including length, solubility 
and ease of synthesis or purification, all of which are routine determinations for those skilled in the art of peptide 

30 synthesis. The method of delivery of the peptide (e.g., tn an aqueous solution or emulsion) may determine the 
particular indel-proximal peptide sequence selected. Preferably, for any particular indel, an overlapping set of peptides 
proximal to that indel is selected and each is tested for its ability to alter the activity of the target protein in a 
variety of assays. Thus, an effective indel-proximal peptide may span the indel site or flank the indel site. Peptides 
that flank an indel site may occur proximal to either the amino or carboxyl terminus of the indel and may have at 

35 least one residue within about one. five, ten, fifteen, twenty or thirty amino acid residues of one terminus of the 
indel. 
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Because this is a genera) method useful for identifying peptides that affect or mimic protein-protein 
interactions, it is useful for any family of proteins for which amino acid sequences are known. The amino acid 
sequences may be known for an entire protein, or only a portion of a protein, within a family of proteins and may 
be predicted based on the nucleic acid seauence coding for a protein. Thus for any family of proteins having amino 
5 acid sequences known for at least two members of the family, the method is useful for identifying peptides that 
modulate or mimic protein activities. 

The general principles of the present invention may be more fully appreciated by reference to the following 
noniimiting examples which demonstrate the utility of the method for identifying inhibitory peptides for the 
complement protein C3/C4/C5 and C2/B families. 

10 EXAMPLE 1: Identification of indelproximal sites can be used to riesion specific peptide inhibitors . 

An alignment of the C3/C4/C5 family of complement proteins using the CLUSTAL W program revealed over 
30 indeis as shown in FIG. 2A-2E. Indels were somewhat arbitrarily defined here as toci where one member of the 
family had an insertion or deletion equal to or greater than two residues in length. However, it will be understood 
that other definitions are also predictive of inhibitory peptides based on this general procedure of identification, 

15 including where one member of the family has an insertion or deletion equal to or greater than one amino acid residue 
in length. In FIG. 2A-2E, the indels are identified by a solid line under the last line of protein sequence for each 
section of six sequences and labelled with consecutive numbers. The amino acid sequences of the peptides that were 
synthesized on the basis of their proximity to these indels are underlined in the human C3, C4 or C5 amino acid 
sequences of FIG. 2A-2E and listed in Table 3 with their corresponding SEQ ID NO. 

20 Previously reported interactive sites in C3, C4. and C5 (shown as lower case letters for the amino acid 

residues in FIG. 2A-2E) were also found to cluster around the indels (e.g., see those listed in Table 1). These 
putative interactive sites have been identified previously by other methods as binding sites to complement control 
proteins, receptors, associating subunits in the complex convertases, and numerous cleavage sites by activating and 
inactivating proteases. 

25 Only a few notable interactive sites were not near indeis. These include the thioester site and the 

proteolytic site that occurs eight residues upstream of the thioester in the C3 a chain that generates the C3d 
fragment from C3dg, which may not be physiologically important (Law, SJCA., 19B8, 1 Ceil Sc L SuppL 9:57-97; 
DeBruijn. M.H.L & Fey, G.H., 1985, Proc. Natl. Acad. ScL USA 82:708712). Another is the isotype-specific site 
in C4, which may be involved in intramolecular interactions (Law, S.K.A. et at., 1984, EMBD J. 3:1819-1823; 

30 Isenman, D.E. & Young, J.R., 1984, J. Immunol 132:3019-3027; Yu f C.Y. et aL f 1986, EMBO J. 5: 2873-2881). 
Thus, the majority of active sites in the C3, C4 and C5 complement family were proximal to indels identified by the 
present method. 

For use in the following examples, enzymes and antisera can be purified using well known techniques. Most 
complement reagents are commercially available (e.g., from Advanced Research Technologies, San Diego, CA). Other 
35 commercial sources include: human CIs (Enzyme research Labs. Inc.. South Bend, IN), human factor B (Calbiochem, 
La Joila, CA), cobra venom factors (Oiamedix. Miami, FL or Quidel. San Diego, CA), human factor D, (Quidel, San 
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Diego, CA), and goat antisera against murine C3 (Organon TeknikaCapel, West Chester, PA) ano numan C3 and C5 
(Quidel, San Oiego, CA). Synthetic peptides may be obtained from Chiron Mimotopes (San Diego, CA). 
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Table 3. Inhibitory activity of C3/C4/C5 peptides in hemolytic assays. 
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5 


54 


111 1 1 


24 


KAFSDRNTLIIYLD NH 2 


0.44 


25 




55 


114 


11 


AcEVVADSVWVDVKDS-NH 2 


0.33 


50 




53 


lll-BC 


23 


Ac-SILDISMMTGFAPDT-NH, 


0.63 


60 




56 


115 


13/14 


AcSEFPESWLWNVEDLNH, 


0.59 


100 




57 


A15 


27/28 


Ac-LSSDFWGEKPNLS 


0.39 


200 


10 


58 


A12 


9 


VNFlLRMDRAHEAK-NHj 


0.24 


200 




59 


111 


1 


Ac-AQGDVPV7VTVHD-NHj 


0.37 


250 




60 


A14 


11 


Ac-SGQREVVAOSVVWDV 


0.34 


250 




61 


1-1 


2 


T1PANREFKSEKGR 


•0.07 


300 




62 


I-4 5 


8 


SITVRTKKQELSEA 


0.07 


300 


15 


63 


1-8 


14 


DLKEPPKNGISTKL 


0.13 


300 




64 


11*3 


7 


Ac-GDGVAKLSINTHPS-NHj 


0.26 


>300 




51 


II 8A S 


23 


Ac-OATMSILDISMMTG-NH, 


0.59 


>3O0 




65 


Ml 


16 


Ac-ERLGREGVQKEDI 


•0.09 


375 




66 


1-6 


11 


YYTLIGASGQREVV 


0.47 


400 


2D 


67 


111-12 


7 


Ac-DGSPAYRVPVAVQGE-NH 2 


0.27 


400 




52 


I1I-8B 


23 


AcDQDATMSILDISMM-NH; 


0.50 


>400 




DO 


1*5 


10 


RLLKAGRQVREPGQ 


0.04 


450 




69 


11-6 


15 


PKSSLSVPYVIVPMH, 


0.71 


>600 




70 


11-7 


20 


AcQVNSLPGSITKAGD-NH, 


0.24 


>600 


25 


71 


IIM 


2 


TVLTPATNHMGNVT-NHj 


0.45 


>600 




72 


C4-B1 


2 


Ac-EVQLVAHSPWLKDS 


0.47 


150 




73 


C5D5 


7 


AcTSOtDPSKSVTRVD 


0.07 


200 




74 


C5D2 


11 


AcTAELVSOSVWLNIE 


0.55 


• 



1 Indel(s) proximal to the peptide sequence in the intact protein. 

2 Unless otherwise indicated by "C4" or "C5" in the peptide name, the peptides have C3 protein sequences. 
Acetylated (Ac ) amino termini and amidated (*NH 2 ) carboxyl termini are as noted. 

5 3 Hydropathy of the peptide indicated using a scale where values about 0 indicate very hydrophilic, and 
values about 1 indicate very hydrophobic. Hydropathies were calculated as the mean of the amino acid 
parameters described by Fauchere, J.L et aL t 1983, Ear. J. Med. Chem. 10:369. 
4 Peptide concentration (//M) giving 50% inhibition of hemolytic activity in about 0.15% human serum, which 
gives about 30% hemolysis of input target EA in the absence of peptide. 

10 5 Peptide alone causes hemolysis. 

Enhances hemolytic activity 3-fold, with 50% efi .ct (2-fold increase) at about 80 fM. 
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EXAMPLE 2: Synthetic Peptides Predicted by Indel Analysis 

The peptides listed in Table 3, representing tndei-proximal sequences in the C3, C4 and C5 proteins, were 
tested for inhibition of complement function. The peptides were synthesizea using well-known methods (Geysen, H.M. 
5 et aL P 1984 f hot. Natl Acad. Sti USA 81:3998; Geysen, H.WL et ai., 1983, J. Immunol. Meth. 102: 259). 

This group of indel-proximal peptides for the C3, C4 and C5 family of proteins are underlined in FIG. 2A-2E. 
The names of the individual peptides (e.g., II- 1) used in Table 3 correspond to the names over the underlined peptide 
sequences in FIG. 2A-2E. Some of the peptides were acetylated at the amino terminus (indicated by "Ac-" in Table 
3) or were amidated at the carboxyi terminus (indicated by n -NH 2 in Table 3). The results presented in column 6 
10 of Table 3 are discussed in detail below at Example 3. 

Although these peptides were synthesized using specific chemical methods, it will be appreciated by those 
skilled in the art that other methods of producing the peptides would be equivalent. That is, other chemical 
procedures may be used to make peptides. Moreover, peptides may be produced by degradation of larger proteins 
containing the peptide sequence followed by purification of the peptide by well known methods. Recombinant genetic 
15 techniques may be used to express a DNA sequence coding for a peptide in either microbial or mammalian celts 
f oDowed by purification of the peptide. Therefore, peptides selected on the basis of the general indel method could 
be produced by a variety of procedures well known to those skilled in the art and the invention should not be 
considered limited to any one particular method of making the peptides so identified. 

AO of the synthesized peptides were tested for their ability to inhibit hi vitro hemolysis of sensitized 
20 erythrocytes by human complement. 

EXAMPLE 3: Hemolysis of Erythrocytes 

Peptides identified in Example 2 were tested for inhibition of complement activity using the hemolytic assay 
for complement activity. The peptides were added to the hemolytic assay and their ability to inhibit lysis of 
erythrocytes or red blood cells (RBC) by complement activity was measured. 
25 In this assay, sheep RBC coated with rabbit anti-sheep RBC antibodies were exposed to dilute human serum. 

Complement is activated on erythrocyte surfaces by the rabbit antibody resulting in lysis of the red blood cells. The 
amount of complement in the serum is determined by the extent of erythrocyte lysis. If peptides inhibit any step 
in the lytic pathway, this inhibition reduces the apparent complement activity of the serum resulting in less 
erythrocyte lysis. 

30 Erythrocyte lysis or hemolytic assays were performed using the "EZ Complement" kit (Diamedix. Miami, FU. 

To 0.5 ml aiiquots of sensitized erythrocytes were added 3 a/1 of a 1:4 dilution of a standard human serum sample, 
a control human serum sample, the peptide of interest, and the control sample plus the peptide. The latter two 
samples measured the hemolytic activity of the peptide alone and the inhibitory effect of the peptide on complement 
lysis, respectively. The level of hemolysis was measured as the absorbance of the solution at 415 nm after 

35 centrifugatton of the sample to remove intact erythrocytes. The level of hemolysis was expressed as the fraction 
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o< hemolysis of the control sample alone, and subtracting trie hemolysis induced by the peptide aione, without added 
control. 

FIG. 9 shows the hemolysis inhibition results tor C3 peptides 114, 11-5. III-8C and 111-11. The lytic activity 
of the peptide alone was negligible for these peptides. The data for the peptides (114, 11-5, III-8C and 111-11) shown 
in FIG. 9 show that increasing concentrations of peptide resulted in decreasing complement activity. The 
concentration of peptide necessary for 50% inhibition differed for each peptide, but was as low as about 25 fjM 
for peptide IN 1 1. Therefore, the hemolysis data indicated that some of the C3 peptides inhibited complement 
function at concentrations consistent with those seen for other inhibitory interface peptides (Dutia, 6.M. et al., 1986, 
Nature 321: 439441: Haigh, A. et al., 1990, Nature 344: 257-259; Babe, LM. et al., 1992, Protein ScL 1: 1244- 
1253). 

The results in FIG. 9 were selected to show the range of responses found among the most strongly 
inhibiting peptides. Results for all the peptides are listed in Table 3, which presents the approximate peptide 
concentration at which 50% inhibition of erythrocyte lysis (IH 50 ) was seen. These results are shown graphically in 
FIG. 10. 

The hemolytic results suggest that specific peptide-protein interactions interfere with the specific protein- 
protein interactions that are necessary for complement function. Inhibition of hemolysis was dose-dependent, and 
the concentration of peptide necessary for 50% inhibition varied substantially with each peptide (from about 25 //M 
to greater than about 600 //M). These results show that some peptides inhibit the complement system specifically. 
Moreover, the range of peptide concentrations needed for inhibition is comparable to that seen in studies of other 
inhibitory peptides in which the concentration of peptide necessary for inhibiting a protein-protein interaction was 
about 100 A/M. This result is especially true for inhibition by peptides III- 11, 114, III8C, and 11-5. 

The mechanism(s) by which the peptides inhibit hemolytic activity of complement is unclear. Inhibition by 
C3 peptides may result from inhibition of C3 interaction with one or several other proteins of the complement 
cascade such as the interaction of substrate C3 with C4b or C2a in the classical pathway convertase. That ts, 
peptides may block formation of the complex with C4b and C2a after cleavage by the C3 convertase, and/or they 
may block recognition of C5 by the C4bC3bC2a complex. Any of these effects results in loss of convertase function 
and inhibition of the complement pathway. 

Table 3 includes data for a single human C4 peptide (peptide C4-B1) and two human C5 peptides (peptides 
C5-D5 and C5D2, respectively), that were also tested for inhibition of hemolytic activity. A total of seven C5 
peptides have been examined to date. Peptide C4-B1, the only C4 peptide chosen for study, shows moderate 
inhibitory activity. Thus, in a single attempt, the indel-proximal method of peptide identification successfully located 
an inhibitory peptide in a second member of the C3/C4/C5 family. Peptide C5D5 shows the greatest inhibitory 
activity of the C5 peptides tested to date although its activity is relatively weak. In contrast, peptide C5 02 shows 
strong potentiating activity in the hemolytic assay, possibly due to inhibition of C5 interaction with a natural inhibitor 
protein in human serum. Based on the results obtained with peptides C5-D5 and C5-02, the disclosed method can 
be used to identify peptides that have potentiating as well as attenuating effects in a complex biochemical system. 
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The inhibitory peptides did not show any pattern of favoreo orientation with regard to indels. That is, 
inhibitory peptides spanned and flanked both the N- and C- termini of indels and came from indels where there were 
deletions and insertions in the C3 sequence relative to tne C4 and/or C5 sequences. Moreover, there was no 
preferred amino acid composition. The peptide length chosen arbitrarily in these examples was 13 to 15 amino acid 
5 residues. 

In the next example, C2 peptides identified by the indei method were synthesized and tested for inhibition 
of complement activity as shown by the data in Table 4. 
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Table 4. Inhibitory activity of C2 peptides in Hemolytic assays. 
SEQ ID NO: Name Sequence 1 Hvdrop 2 IH, 



75 El Ac-MRLLGMETMAWQE 0.57 30 

76 E2 REILNINQKRNDYm -0.01 >300 
77 E4 WRVNVGOPKSQWGK-NH, 0^ >30D 

1 The single-letter amino acid code is used with Acetylated (Ac-) amino termini and amidated {-NH 2 ) carboxy 
termini. 

2 Hydropathy of the peptide indicated using a scale where values about 0 indicate very hydrophilic, and 
values about 1 indicate very hydrophobic. Hydropathies were calculated as the mean of the amino acid 
parameters described by Fauchere, J.L et al.. 1983. Eur. J. Med. Chem. 10:369. 

3 Peptide concentration (a/M) giving 50% inhibition of hemolytic activity (CH^) in about 0.15% human serum, 
which gives about 30% hemolysis of input target EA in the absence of peptide. 
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EXAMPLE 4: Peptide inhibitors from the C2/factorB complement protein family designed by indel association. 

To test further the effectiveness of the indel-proximal method, the C2ifactor B family of complement 
proteins was analyzed. C2 and factor B are structurally similar proteins (about 100,000 m.w.) that play analogous 
roles in the classical and alternative pathways, respectively, of complement activation and whose genes arose from 
a common ancestor (Campbell, R.D. et al., 1988, Ann. Rev. Immunol. 6:161-195). Using procedures essentially as 
described in Example 1. members of the C2/factor B complement protein family were aligned as shown in FIG. 11 A- 
11B. indels were identified and three indel-oroximai peptides were selected for testing (underlined in FIG. 11A-11B). 
As seen by the data in Table 4, two peptides, E2 and E4 (SEQ ID N0:76 and SEQ ID NO:77, respectively), showed 
no evidence of inhibitory activity in the hemolytic assay. Peptide El (SEO ID N0:75), however, showed very potent 
activity, as shown in FIG. 12 which presents inhibition of complement hemolytic activity by peptide El in the 
presence and absence of serum in the assay. About 30 pfA of peptide El resulted in 50% inhibition, which is 
essentially equal to the most potent C3 inhibitor, peptide 111-11 (SEQ ID N0:54). 

Hence the indel-proximal method of peptide identification has been successfully used to guide the design 
of peptide inhibitors for another protein family in the complement system Dy efficiently identifying protein interactive 
regions and inhibitory peptides which are presumably interface peptides. 
EXAMPLE 5: Generation of Antibodies Against Indelassociated Peptides 

Antibodies that recognize the inhibitory peptides identified by the indel method may also be useful reagents 
for inhibiting protein activity. Such antibodies would be expected to bind to the indel-proximal portion of the native 
protein, thus blocking activity associated with that portion of the protein. 

Polyclonal and monoclonal antibodies can be generated against the purified peptides designed by using the 
indel method. Monoclonal antibodies useful in the present invention can be produced and isolated by processes 
which are well known in the art. such as those discussed by Milstein and Kohler [Nature 256:495497, 1975). 
Monoclonal antibodies may be more useful because of the specificity associated with such antibodies and the ability 
to produce antibodies with the same specificity from a clonal cell line. Peptides designed by using the indel method 
that have inhibitory activity in an in vitro or in vivo assay are synthesized and monoclonal antibodies are then 
generated using the purified peptides by standard procedures. 

In accordance with one known process for preparing monoclonal antibodies, mice such as Balb/c femaie mice 
or other mouse strains or even other suitable animals (e.g., rats or rabbits) are immunized with an amount of a 
peptide of interest, such as those identified in Examples 3 and 4, to initiate an immune response. The animals are 
immunized with the peptides mixed with a suitable adjuvant or with peptides conjugated to a carrier molecule. The 
peptide dosage and immunization schedule for producing useful quantities of suitable splenocytes can be readily 
determined by one skilled in the art depending on the animal strain used. 

The size and spacing of doses of peptide are generally microgram quantities with a minimum dosage for 
initiating an immune response typically in the range of 0.1-100 /yg/animal. For example, an initial immunization with 
approximately 50 j/g of peptide may be followed by a series of five injections for hyperimmuncation. An adfuvant 
(e.g.. Freund's incomplete and complete adjuvants and alum gels) may be mixed with the peptide antigen to ennance 
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antibody production against the antigen using methods well known in the art. Thus, a given dose of peptide may 
be more effective when injected subcutaneousiy with an adjuvant or when injected as repeated small aliquots than 
when administered intravenously. 

Following the primary immunization with the peptide antigen, the animal is monitored for production of anti 
5 peptide specific serum antibodies using well known techniques (e.g., ELISA or radioimmunoassay) normally about one 
to two weeks after immunization. After the primary response to the peptide antigen is detected, a second dose of 
the same antigen is given to elicit a peptidespecific secondary immune response which is also detected by standard 
immunoassays. After one to five booster immunizations, the animal is killed and its spleen cells are isolated and 
fused with myeloma cells (e.g., the murine cell line Sp2/0-Ag14) to produce hybridoma cell lines capable of 

10 reproduction in vitro to produce anti-peptide antibodies. 

The myeloma cell line selected should be compatible with the spleen cells, and optimally should be a ceil 
fine of the same species as the spleen cells. Although the murine cell line Sp2/0-Ag14 has been found to be 
effective for use with mouse spleen cells, other myeloma cell lines can alternatively be used. See, for example. 
Nature, 276: 269-270 (1978) and U.S. Patent No. 5.472,868, for fusion partner cells and methods of using such 

15 cells to produce hybridomas. 

Spleen cells are fused with an appropriate myeloma cell line using polyethylene giycol. After fusion of 
spleen and myeloma cells, the mixture of unfused spleen cells, unfused myeloma ceils and fused cells are diluted and 
cultured in a selective medium (e.g., containing hypoxanthine, aminopterin and thymidine) that supports growth of 
fused cells and will not support the growth of the unfused myeloma cells for a time sufficient to allow death of all 

20 unfused cells. Since the unfused spleen cells are nonmalignant, they have only a finite number of generations until 
they fail to reproduce. The fused cells reproduce because they possess the malignant quality contributed by the 
myeloma parent and the enzyme necessary to survive in the selected medium contributed by the spleen cell parent 
The supernatant from each of a plurality of hybridoma-containing tissue culture wefls is evaluated for the 
presence of antibody specific to the peptide of interest using any of a number of well known immunoassays. 

25 Hybridomas that produce antibodies that specifically recognize the peptide used as an immunogen are cloned (e.g., 
by limiting dilution) for subsequent production of antibodies in vitro or in vivo. Anti-peptide antibodies are produced 
either in vitro by tissue culture of the selected cell lines or in vivo by generating ascites fluid in mice injected with 
the hybridoma cell line. The monoclonal antibody may then be isolated in accordance with techniques known in the 
art. 

30 Purified Anti-peptide antibodies are identified as those capable of modulating a biological system, such as 

the complement system, using the appropriate in vitro or m vivo assay for the target protein or protein system. For 
example, the monoclonal antibodies are tested for their ability to enhance or inhibit hemolysis in the assay as 
described in Example 3. The anti-peptide antibody is added into the hemolysis assay alone or in combination with 
the corresponding peptide against which the antibody was raised. To further test antibodies identified as having a 

35 biological activity, they are tested for their ability to mediate a therapeutic effect as in the in vivo assay described 
in Example 6. That is, the antipeptide antibody is injected before, simultaneous or after injection of the 
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corresponding peptide or instead of the peptide during the antibody challenge and modulation of the inflammation 

response is monitored relative to the appropriate control reactions, 

EXAMPLE 6: In Vivo Assay to Measure Peptide Inhibition of Complement-mediated Inflammation 

The Arthus model of complement-mediated inflammation is caused by interaction of tissue antigen with 
5 circulating antibody. This in vivo model of complement activation is characterized by formation of immune complexes 
resulting in complement activation, inflammatory ceil recruitment, edema and tissue damage (BaSey, P. & Sturm. A., 
1983, Biochem. Pharm. 32: 475). A passive Arthus reaction is established in an animal (e.g., a guinea pig) by first 
injecting an antigen i.v. and then challenging with an antigen-specific antibody. Peptide inhibition of the complement- 
mediated response is measured when peptide is injected Ld. before or simultaneous with antibody challenge and then 

10 biopsy tissue taken from the antibody challenge site is assayed for inflammation. 

Male guinea pigs (about 300 g) are anesthetized by injection of sodium pentobarbital (40 mg/kg, i.p.) and 
then injected i.v. with ovalbumin (20 mg/kg) and 125 i labeled bovine serum albumin ( 125 I BSA, 1 //Ci). Antibody 
challenge immediately follows by injecting animals in the dorsal region with polyclonal antiovalbumin antibody (10 
mg, Ld.), with or without an indel identified peptide in the iM range indicated as effective complement inhibition 

15 by the hemolysis assay (see Example 3). 

Three hours after injection of the antibody challenge, the animals are humanely killed and skin tissue from 
the antibody challenge site is excised by biopsy punch (about 2-5 mm). Inflammation is measured by leakage of "l- 
BSA into the skin tissue determined by standard radiochemical counting procedures for 125 1; percent inhibition by 
peptide is measured by comparing leakage of 12S I BSA into the skin tissue in the presence of peptide with that seen 

20 in the absence of peptide. As a positive control for complement inhibition, a separate animal is injected with cobra 
venom factor (200 U/kg, i.p.) 24 hr before initiation of the Arthus response; the venom factor results in suppression 
of the complement response in animals that receive antigen and antibody without complement-inhibitory peptide. 

Peptides identified as being complement inhibitory in the hemolysis assay show complement inhibition in this 
in vivo model in a dose-dependent manner with maximal inhibition comparable to the animal that received cobra 

25 venom factor before initiation of the Arthus response. For example, a series of 250-300 g male guinea pigs are 
anesthetized (with sodium pentobarbital, at 40 mg/kg, i.p.) and then injected Lv. with ovalbumin (20 mg/kg) and 
labeled bovine serum albumin ( 125 I-BSA, 1 /yCi). One of the animals is the cobra venom factor control which received 
cobra venom factor (200 U/kg, i.p.) at about 24 hr before injection of the ovalbumin and 125 I-BSA. The other animals 
are individually i.v. injected with 0.1 /yg, 0.5 j/g, 1 |/g, 5 m and 10 //g of peptide III-8C (SEQ ID NO:53), or no 

30 peptide, simultaneously with the ovalbumin and ,25 I BSA i.v. injection. Each animal then receives polyclonal anti- 
ovalbumin antibody (10 mg, Ld. in the dorsal region). Three hours after injection of the antiovalbumin antibody 
challenge, the animals are humanely killed and skin tissue from the antibody challenge site is excised (by biopsy 
punch, 2-5 mm). Inflammation is measured by leakage of ,25 l BSA into the skin tissue determined by standard 
radiochemical counting procedures for 12S I to determine the percentage of inhibition of inflammation mediated by the 

35 peptide III-8C dosage. The inhibition seen in the venom factor control animal serves as an arbitrary 100% inhibition 
data point whereas the data for the animal that received no peptide and no venom factor serves as an arbitrary 0% 
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inhibition data point. By comparison with the results in control animals, the animals that receive peptide III8C show 
about 5-10% inhibition with 0.1 /sg peptide, about 7-21 % inhibition with 0.5 //g, about 10-35% inhibition with 1 
pq, about 2045% inhibition with 5 /sg, and about 40-80% inhibition with 10 //g of peptide III-8C. Thus, in vivo 
activity of a peptide parallels that seen in vitro. Although the mechanism of this activity is not known, it may result 
5 from inhibition of one or more proteins of the complement system, competition with C3 protein, or binding to a 
protein or ligand such as a C3 convertase. 

EXAMPLE 7: Selection and/or Synthesis of Peotidomimetic Inhibitors Based on IndeMdenttfied Peptides 

Because peptidomimetic molecules can exhibit increased potency as inhibitory molecules for protein-protein 
interactions, peptidomimetics of an inhibitory peptide identified using the indel method and the hemolytic assay are 

10 selected and screened for increased inhibitory activity. 

Peptidomimetics of peptide I1 1 (SEQ ID N0:59) are selected from a combinatorial peptide library in which 
the internal five amino acid residues (VPVTV) are randomized (Bianchi, E. et at., 1995, J. MoL Biol. 247: 154-160) 
producing a library of peptides that are analogs of peptide IM. The library of peptides can be produced either using 
molecular genetic techniques (e.g., synthesis of the partially randomized DNA sequences coding for the peptide IM 

15 analogs followed by expression of the peptides in cells and purification of the peptides) or by chemically synthesizing 
the partially randomized peptides. Chemical synthesis allows additional expansion of the repertoire by allowing 
inclusion of non-coded amino acids or organic tsosteric replacement groups (e.g., a 4(5)-acyiimidazole ring for 
replacement of an amide bond). The peptide analogs and peptidomimetics are tested in vitro for their inhibitory 
activity relative to peptide 11-1 in a hemolytic assay essentially as described in Examples 3 and 4 and in an in vivo 

20 assay essentially as described in Example 6. Some of the partially randomized peptidomimetics and analogs of 
peptide 11-1 show increased inhibition of complement function in the hemolysis assay, some show increased inhibition 
the animal model and some show increased inhibitory activity in both assays. 

Peptide 18 (SEQ 10 N0:63) is used to synthesize a retro-O-amino acid peptide that is an isomer of the 
corresponding L-amino acid 1-8 peptide. That is, a peptide is synthesized using only D-amtno acids and having a 

25 sequence LKTSIGNKPPEKLD (SEQ ID NO:78) which is the reverse of the sequence of peptide 1-8. The 1-8 peptide 
and the retro-D-peptide of 1-8 are compared in the hemolytic assay as described in Example 3. The 1-8 shows little 
or no inhibition of hemolysis, requiring a peptide concentration of about 300 fjU to produce 50% inhibition of the 
hemolytic activity in 0.15% human serum. In contrast, the retroO-peptide of 1-8 having reversed peptide bond 
orientation of the corresponding L-amino acid peptide shows increased inhibition because it produces 50% inhibition 

30 at a concentration of about 150 /sM in the same type of reaction. 

Another peptide analog is produced as a covatently cycfized version of peptide 1*5 having the linear sequence 
CRLLKAGRQVREPGQC (SEQ ID NO:79) (i.e., addition of cysteine residues at the amino- and carboxyl termini of peptide 
15). The linear peptide having SEQ ID NO:79 is synthesized using conventional solid-phase protein synthesis and 
purified by reverse phase HPLC using standard methods. The purified peptide is then oxidized to covatently cyclize 

35 the terminal cysteine sulphydryl side chains to form an intrachain disulfide bridge (such as described by McDonnell, 
J.M. et aL, 1996. Nature Structural BiaL 3(51:419426), thus constraining the peptide structurally. Moreover, the 
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ccrresponding linear version of the peptide is used as i onuoi and ail three peptides (1-5, the linear and cyciized 
forms of SEQ ID N0:79) are compared in the hemolytic assay as described in Example 3. The 1-5 peptide shows 
limited inhibition of hemolysis, requiring a peptide concentration of about 450 //M to produce 50% inhflmion of the 
hemolytic activity in 0.15% human serum. Similarly, the linear version of the peptide having SEQ ID N0:79 also 
5 shows limited inhibition of hemolysis, requiring a peptide concentration of about 350-400 /sM to produce 50% 
inhibition of the hemolytic activity in 0.15% human serum. In contrast, the cyciized version of the peptide having 
SEQ ID NQ:79 produces no inhibition of hemolysis and enhances hemolytic activity about 2-fold at a concentration 
of about 100 fjM in the same type of reaction. Thus the physical form of the peptide having SEQ ID N0:7S 
substantially affects its ability to modulate protein activity in this assay. 

10 EXAMPLE 8: lndel-associated oeotide substitution for a target protein's receptor 

A series of peptides are generated for the sequence of complement C3 protein near indel 13 as shown in 
FIG. 2A-2E, such as peptides 11-5 (SEQ ID N0:56) and 1-8 (SEQ ID N0:63), and cyciized peptidomimetics of these 
peptides in which the amino- and carboxyltermini D-cysteines joined by a disulfide bridge are synthesized using 
techniques described earlier. Peptide 1-11 (SEQ ID NO:65), overlapping indel 16 in FIG. 2A-2E, and its corresponding 

15 cyclized-D-cysteine peptidomimetic is used as a negative control and purified C3 protein is used as a positive control 
These peptides and protein are radiolabeled using standard procedures and the labeled peptides and protein are tested 
for their ability to bind to complement receptors CR1, CR2 and CR3 using in vitro binding assays known in the art 
(Diefenbach, R.J. & Isenman, D.E., 1995, J. Immunol. 154:2303; Taniguchi-Stdle. A. & Isenman, D.L, 1994, •/. 
Immunol. 153:5285). These binding assays show that purified C3 binds to complement receptors CR1, CR2 and 

20 CR3, neither the linear peptide M 1 or its cyciized peptidomimetic bind to any of the receptors, and the linear forms 
of peptides 11-5 and 1-8 bind to all three C3 receptors but never more than about 40-50% of the efficiency of binding 
as with purified C3 protein. The cyciized peptidomimetic of peptide 11-5 binds to all three receptors but less 
efficiently than its linear form whereas the cyciized peptidomimetic of peptide 1-8 binds slightly more efficiently to 
all three receptors than the linear peptide. These results show that peptides identified by the indel-association 

25 method and peptidomimetics modeled on such peptides can substitute for a ligand or substrate of the target protein. 
EXAMPLE 9: Pharmaceutical composition including an indel-proximal identified peptide or peptide analoo or 
peotidomtmetic thereof 

Peptides, peptide analogs or peptidomimetic molecules according to the present invention are identified, 
synthesized and tested for biological activity in an m vitro and/or in vivo assay as described in the previous examples 

30 or using other well known methods and assays. For delivery of a peptide, peptide analog or peptidomimetic molecule 
(hereafter generally referred to as "peptide"), the peptide is dissolved in a buffered salt solution (e.g., phosphate 
buffered saline) at a dose of 1 /ygfml to 100 //giml depending on the activity of the peptide as determined in an in 
vivo or in vitro assay. The peptide solution is then injected in 0.5-5 ml aliquots into the mammal including a human 
to be treated via any of a variety well known routes (Lv„ Lm., s.c, etc.) depending on the type of treatment. For 

35 example, for a localized inflammatory response, a peptide that inhibits complement activity (e.g., peptide HMD is 
injected locally near the area of inflammation, with repeated dosages periodically until inflammation is sufficiently 
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decreased as determined by the treating physician. In another application, a patient experiencing ischemia-reperfusion 
following myocardial infarction may be treated with a complement-inhibiting peptide delivered by catheterization to 
defiver the peptide selectively to the affected area. 

For peptides that have been identified as being active by eliciting antibodies to the peptide which then 
5 mediate the desired anti-target protein activity, the peptide is preferably coupled to ovalbumin grade V at about a 
10:1 molar ratio using N-succinimidyl 3 {2-pyndyldithio) propionate (SPDP). Briefly, SPOP (40 mM in 99% ethanol 
is added dropwise to ovalbumin (dissolved in 0.2M NaH 2 P0«, pH 8.5) to form a SPDP-ovaibumin conjugate that is 
purified by column chromatography and coupled to peptide (1 mg/ml in 10% acetic acid) by allowing the peptide and 
SPDPovalbumin to incubate for about 12 hr at RT. The peptide-SPDP-ovalbumin conjugate (100 //g/0.5 ml of 

10 phosphate buffered saline, with or without an adjuvant) is injected i.m. in a series of three injections about one week 
apart to induce an anti-peptide immune response in the mammal receiving the peptide. 

Although the present invention has been described in the context of particular examples and preferred 
embodiments, it will be understood that the invention is not limited to such embodiments. Instead, the scope of the 
present invention shall be measured by the claims that follow and all modifications which come within the meaning 

15 and range of the lawful equivalency of the claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Lidak Pharmaceuticals 

<ii) TITLE OF THE INVENTION: METHOD FOR IDENTIFYING PEPTIDES THAT 
AFFECT PROTEIN-PROTEIN INTERACTIONS AND COMPLEMENT - MODULATING PEPTIDES 

iiii) NUMBER OF SEQUENCES : 79 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Knobbe, Martens, Olson and Bear 

(B) STREET: 620 Newport Center Blvd. 16th Floor 

(C) CITY: Newport Beach 

(D) STATE: CA~ 

(E) COUNTRY: USA 

(F) ZIP : 92660 

(v) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 
<B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Israelsen, Ned A 

(B) REGISTRATION NUMBER: 29,655 

(C) REFERENCE /DOCKET NUMBER: LIDAK. 04 8A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-235-8550 

(B) TELEFAX: 619-235-0176 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

< ii ) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) AKTISENSE: NO 

iv) FRAGMENT TYPE: internal 
ivi) ORIGINAL SOURCE: 

txi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Val Pro Val Ala Val Gin Glv Glu Aso Thr Val Gin Ser Leu Thr Gin 

1 5 * 10 15 

Gly Asp Gly 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Val Leu Val Val Thr Gin Gly Ser Asn Ala Lys Ala Leu Thr Gin Asp 

15 10 15 

Asp Gly 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 ammo acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

lie Pro Val Lys Val Ser Ala Thr Val Ser Ser Pro Gly Ser Val Pro 

15 10 15 

Glu Ala Gin Asp lie Gin Gin Asn Thr Asp Gly 
20 25 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Val Pro Val Lys Val Ser Ala Thr Leu Val Ser Gly Ser Asp Ser Gin 

15 10 15 

Val Leu Asp He Gin Gin Ser Thr Asn Gly 
20 25 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Val Pro Val lie Leu Asn Ala Gin Thr He Asp Val Asn Gin Glu Thr 

15 10 15 

Ser Asd Leu Asp Pro Ser Lys Ser Val Thr Arg Val Asp Asp Gly 
20 25 30 

(2) INFORMATION FOR SEC ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Val Pro Val Thr Leu Met Ala Gin Thr Val Asp Val Asn Gin Glu Thr 

15 10 15 

Ser Asp Leu Glu Thr Lys Arg Ser He Thr His Asp Thr Asp Gly 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1663 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : I inear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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Val 






375 




Val 


Ala 


Val 


Gin 




390 






Gly 


Val 


Ala 


Lys 


405 








Ser 


He 


Thr 


Val 


Ala 


Thr 


Arg 


Thr 








440 


Asn 


Asn 


Tyr 


Leu 






455 




Glu 


Thr 


Leu 


Asn 




470 






Ala 


Lys 


He 


Arg 


485 








Leu 


Lys 


Ala Gly 


Leu 


Pro 


Leu 


Ser 








520 


Ala 


Tyr 


Tyr 


Thr 






535 




Asp 


Ser 


Val 


Trp 




550 






Val 


Lys 


Ser Gly 


565 








Met 


Thr 


Leu 


Lys 


Ala 


Val 


Asp 


Lys 








600 


Gin 


Ser 


Lys 


He 






615 




Pro 


Gly 


Ser Gly 




630 






Thr 


Phe 


Thr 


Ser 


645 








Gin 


Cys 


Pro 


Gin 


Glu 


Lys 


Arg 


Met 








680 


Lys 


Asp 


Ser 


He 






695 




Lys 


Lys 


Gly 


lie 




710 
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Asn Pro Glu Gly 
170 . 

Gin Leu Gly Val 
185 

Met Gly Gin Trp 

Val Phe Ser Thr 
220 

Glu Val He Val 
235 

Lys Gly Leu Glu 
250 

Val Glu Gly Thr 
265 

Arg He Ser Leu 

Ser Gly Glu Val 
300 

Asn Leu Arg Ala 
315 

Thr Val He Leu 
330 

Gly He Pro He 
345 

Pro Lys Tyr Phe 

Thr Asn Pro Asp 
380 

Gly Glu Asp Thr 
395 

Leu Ser He Asn 
410 

Arg Thr Lys Lys 
425 

Met Gin Ala Leu 

His Leu Ser Val 
460 

Val Asn Phe Leu 
475 

Tyr Tyr Thr Tyr 
490 

Arg Gin Val Arg 
505 

He Thr Thr Asp 

Leu He Gly Ala 
540 

Val Asp Val Lys 
555 

Gin Ser Glu Asp 
570 

He Glu Gly Asp 
585 

Gly Val Phe Val 

Tro Asp Val Val 
620 

Lvs Asp Tyr Ala 
635 

Ser Ser Gly Gin 
650 

Pro Ala Ala Arg 
665 

Asp Lys He Ser 

Thr Thr Trp Glu 
700 

Cys Val Ala Asp 
715 



He Pro Val Lys 
175 

Leu Pro Leu Ser 
190 

Lys lie Arg Ala 
205 

Glu Phe Glu Val 

Glu Pro Thr Glu 
240 

Val Thr He Thr 
255 

Ala Phe Val He 
270 

Pro Glu Ser Leu 
285 

Val Leu Ser Arg 

Glu Asp Leu Val 
320 

His Ser Gly Ser 
335 

Val Thr Ser Pro 
350 

Lys Pro Gly Met 
365 

Gly Ser Pro Ala 

Val Gin Ser Leu 
400 

Thr His Pro Ser 
415 

Gin Glu Leu Ser 
430 

Pro Tyr Ser Thr 
445 

Leu Arg Thr Glu 

Leu Arg Met Asp 
480 

Leu He Met Asn 
495 

Glu Pro Gly Gin 
510 

Phe lie Pro Ser 
525 

Ser Gly Gin Arg 

Asp Ser Cys Val 
560 

Arg Gin Pro Val 
575 

His Gly Ala Arg 
590 

Leu Asn Lys Lys 
605 

Glu Lys Ala Asp 

Gly Val Phe Ser 
640 

Gin Thr Ala Gin 
655 

Arg Arg Arg Ser 
670 

Thr Lys Leu Met 
685 

He Leu Ala Val 

Pro Phe Glu Val 
720 
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"Thr Val 


Met Gin Asp Phe 


Phe 


He 


Asp 


Leu Arg 


Leu Pro 


Tyr 


Ser Val 






725 








730 






735 


Val 


Arg 


Asn Glu Gin Val 


Glu 


He Arg 


Ala Val 


Leu Tyr Asn Tyr Arg 






740 






74 D 






750 




Gin 


Asn 


Gin Glu Leu Lys 


Val 


Arg 


Val 


Glu Leu 


Leu His 


Asn 


Pro Ala 






755 




760 






765 






Phe 


Cys 


Ser Leu Ala Thr 


Thr 


Lys 


Arg 


Arg His 


Gin Gin 


Thr 


Val Thr 




770 




775 








780 






lie 


Pro 


Pro Lys Ser Ser 


Leu 


Ser 


Val 


Pro Tyr 


Val He 


Val 


Pro Leu 


785 




790 








795 






800 


Lys 


Thr 


Gly Leu Val Gly 


Lys 


Tyr 


Pro 


Lys Glu 


Leu Arg 


Lys 


Cys Cys 






805 








810 






815 


Glu 


Asp 


Gly Met Arg Glu 


Asn 


Pro 


Met 


Arg Phe 


Ser Cys 


Gin Arg Arg 






820 






825 






830 




Thr 


Arg 


Phe He Ser Leu 


Gly 


Glu 


Ala 


Cys Lys 


Lys Val 


Phe 


Leu Asp 







835 




840 






845 






Cys 


Cys 


Asn Tyr He Thr 


Glu 


Leu 


Arg 


Arg Gin 


His Ala 


Arg 


Ala Ser 




850 




855 








860 






His 


Leu 


Gly Leu Ala Arg 


Ser 


Asn 


Leu 


Asp Glu 


Asp He 


He 


Ala Glu 


865 




870 








875 






880 


Glu 


Asn 


He Val Ser Arg 


Ser 


Glu 


Phe 


Pro Glu 


Ser Trp 


Leu 


Trp Asn 






885 








690 






895 


Val 


Glu 


Asp Leu Lys Glu 


Pro 


Pro 


Lys 


Asn Gly 


Gin Glu 


Val 


Glu Val 






900 






905 






910 




Lys 


Ala 


Ala Val Tyr His 


His 


Phe 


He 


Ser Asp 


Gly Val 


Arg 


Lys Ser 






915 




920 






925 






Leu 


Lys 


Val Val Pro Glu 


Gly 


He 


Arg 


Met Asn 


Lys Thr 


Val 


Ala Val 




930 




935 








940 






Arg 


Thr 


Leu Asp Pro Glu 


Arg 


Leu Gly 


Arg Glu 


Gly Val 


Gin 


Lys Glu 


945 




950 








955 






960 


Asp 


He 


Pro Pro Ala Asp 


Leu 


Ser Asp 


Gin Val 


Pro Asp 


Thr 


Glu Ser 






965 








970 






975 


Glu 


Thr 


Arg He Leu Leu 


Gin 


Gly Thr 


Pro Val 


Ala Gin 


Met 


Thr Glu 






980 






985 






990 




Asp 


Ala 


Val Asp Ala Glu 


Arg 


Leu 


Lys 


His Leu 


He Val 


Thr 


Pro Ser 






995 


1000 






1005 






Gly 


Cys 


Gly Glu Gin Asn 


Met 


He 


Gly 


Met Thr 


Pro Thr 


Val 


He Ala 


1010 


1015 






1020 . 






Val 


His 


Tyr Leu Asp Glu 


Thr 


Glu 


Gin 


Trp Glu Lys Phe Gly Leu Glu 


025 




1030 








1035 






1040 


Lys 


Arg 


Gin Gly Ala Leu Glu 


Leu 


He 


Lys Lys 


Gly Tyr Thr Gin Gin 






1045 






1050 




1055 


Leu 


Ala 


Phe Arg Gin Pro 


Ser 


Ser 


Ala 


Phe Ala 


Ala Phe 


Val 


Lys Arg 






1060 




1065 




1070 




Ala 


Pro 


Ser Thr Trp Leu 


Thr 


Ala 


Tyr 


Val Val 


Lys Val 


Phe 


Ser Leu 




1075 


1080 






1085 






Ala 


Val 


Asn Leu He Ala 


He 


Asp 


Ser 


Gin Val 


Leu Cys Gly Ala Val 


1090 


1095 






1100 






Lys 


Trp Leu He Leu Glu 


Lys 


Gin Lys 


Pro Asp Gly Val 


Phe 


Gin Glu 


105 




1110 








1115 






1120 


Asp 


Ala 


Pro Val He His 


Gin 


Glu 


Met 


lie Gly Gly Leu Arg Asn Asn 



1125 1130 1135 



Asn Glu Lys Asp Met Ala Leu Thr Ala Phe Val Leu He Ser Leu Gin 

1140 1145 1150 

Glu Ala Lys Asp He Cys Glu Glu Gin Val Asn Ser Leu Pro Gly Ser 

1155 1160 1165 

He Thr Lys Ala Gly Asp Phe Leu Glu Ala Asn Tyr Met Asn Leu Gin 

1170 1175 1180 

Arg Ser Tyr Thr Val Ala lie Ala Gly Tyr Ala Leu Ala Gin Met Gly 
185 1190 1195 1200 

Arg Leu Lys Gly Pro Leu Leu Asn Lys Phe Leu Thr Thr Ala Lys Asp 

1205 1210 1215 

Lys Asn Arg Trp Glu Asp Pro Gly Lys Gin Leu Tyr Asn Val Glu Ala 

1220 1225 1230 

Thr Ser Tyr Ala Leu Leu Ala Leu Leu Gin Leu Lys Asp Phe Asp Phe 

1235 1240 1245 

Val Pro Pro Val Val Arg Trp Leu Asn Glu Gin Arg Tyr Tyr Gly Gly 

1250 1255 1260 

Gly Tvr Gly Ser Thr Gin Ala Thr Phe Met Val Phe Gin Ala Leu Ala 
265 1270 1275 1280 
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Gln Tyr Gin Lys Asp Ala Pro Asp His Gin Glu Leu Asn Leu Asp Val 

1285 1290 1295 

Ser Leu Gin Leu Pro Ser Arg Ser Ser Lys lie Thr His Arg lie His 

1300 1305 1310 

Trp Glu Ser Ala Ser Leu Leu Arg Ser Glu Glu Thr Lys Glu Asn Glu 

1315 1320 1325 

Gly Phe Thr Val Thr Ala Glu Gly Lys Glv Gin Gly Thr Leu Ser Val 

1330 1335 " 1340 

Val Thr Met Tyr His Ala Lys Ala Lys Asd Gin Leu Thr Cys Asn Lys 
345 1350 "l355 1360 

Phe Asp Leu Lys Val Thr lie Lys Pro Ala Pro Glu Thr Glu Lys Arg 

1365 1370 1375 

Pro Gin Asp Ala Lys Asn Thr Met He Leu Glu He Cys Thr Arg Tyr 

1380 1385 1390 

Arg Gly Asp Gin Asp Ala Thr Met Ser He Leu Asp He Ser Met Met 

1395 1400 1405 

Thr Gly Phe Ala Pro Asp Thr Asp Asp Leu Lys Gin Leu Ala Asn Gly 

1410 1415 1420 

Val Asp Arg Tyr He Ser Lys Tyr Glu Leu Asp Lys Ala Phe Ser Asp 
425 1430 1435 1440 

Arg Asn Thr Leu lie He Tyr Leu Asp Lys Val Ser His Ser Glu Asp 

1445 1450 1455 

Asp Cys Leu Ala Phe Lys Val His Gin Tyr Phe Asn Val Glu Leu He 

1460 1465 1470 

Gin Pro Gly Ala Val Lys Val Tyr Ala Tyr Tyr Asn Leu Glu Glu Ser 

1475 1480 1485 

Cys Thr Arg Phe Tyr His Pro Glu Lys Glu Asp Gly Lys Leu Asn Lys 

1490 1495 1500 

Leu Cys Arg Asp Glu Leu Cys Arg Cys Ala Glu Glu Asn Cys Phe He 
505 1510 1515 1520 

Gin Lys Ser Asp Asp Lys Val Thr Leu Glu Glu Arg Leu Asp Lys Ala 

1525 1530 1535 

Cys Glu Pro Gly Val Asp Tyr Val Tyr Lys Thr Arg Leu Val Lys Val 

1540 1545 1550 

Gin Leu Ser Asn Asp Phe Asp Glu Tyr He Met Ala lie Glu Gin Thr 

1555 1560 1565 

He Lys Ser Gly Ser Asp Glu Val Gin Val Gly Gin Gin Arg Thr Phe 

1570 1575 1580 

He Ser Pro lie Lys Cys Arg Glu Ala Leu Lys Leu Glu Glu Lys Lys 
585 1590 1595 1600 

His Tyr Leu Met Trp Gly Leu Ser Ser Asp Phe Trp Gly Glu Lys Pro 

1605 1610 1615 

Asn Leu Ser Tyr lie lie Gly Lys Asp Thr Trp Val Glu His Trp Pro 

1620 1625 1630 

Glu Glu Asp Glu Cys Gin Asp Glu Glu Asn Gin Lys Gin Cys Gin Asp 

1635 1640 1645 

Leu Gly Ala Phe Thr Glu Ser Met Val Val Phe Gly Cys Pro Asn 
1650 1655 1660 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1663 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met Gly Pro Ala Ser Gly Ser Gin Leu Leu Val Leu Leu Leu Leu Leu 

1 5 10 15 

Ala Ser Ser Pro Leu Ala Leu Gly He Pro Met Tyr Ser He lie Thr 

20 25 30 

Pr Asn Val Leu Arg Leu Glu Ser Glu Glu Thr lie Val Leu Glu Ala 
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3 5 




Hie 


21 err* 


Ala 


Gin 




50 






ucu 




At*CT 


Gin 


65 








Car 




His 


Leu 


ir ne 


Asn 


Cor 

OCX 


Aso 








100 


Phe 


Gly 


Glu 


Thr 




115 




Gly 




Leu 


Phe 




130 






Thr 


Vai 


Leu 


Tvr 


145 










Lys 


Thr 


Vai 


T 

Lys 


Arg 




He 








180 


Ser 


Trp 


Asn 


He 










Ala 


pne 


Tyr 


blU 










vai 


Liys 


Glu 


Tyr 


*5 *5 C 








blU 


1112? 


Phe 


Tyr 


lie 


Ala 


Lys 


Phe 








260 


lie 


Phe 


Gly Vai 






275 




Leu 


Thr 


Arg 


Vai 




290 






Arg 


Lys 


Vai 


Leu 


1 AC 

3 Ob 








vai 


Gly 


Lys 


Ser 


Ssr 


Asp 


Met 


Vai 








340 


Pro 


Tyr 


Gin 


He 




355 




neu 


Pro 


Phe 


Asp 










Aid 


Ser 


Lys 


Vai 


IOC 








l Hi 




Asp Asp 


Arg 


bin 


Pro 


Leu 








420 


Glu 


Ser 


Arg 


Gin 






435 




Met 


His 


Asn 


Ser 










Lieu 


Lys 


Pro 


Gly 


A £ C 








Pro 


Caiy 


His 


Glu 


Liys 


Caiy 


Lys 


Leu 








500 


Asp 


Leu 


Vai 


Vai 






515 




Pne 


Arg 


Leu 


Vai 




530 






Glu 


Vai 


Vai 


Ala 


545 








Gly 


Thr 


Leu 


Vai 


Gly 


Gin 


Gin 


Thr 








580 


Gly 


Leu 


Vai 


Ala 



40 

Glv Asp He Pro 
55 

Vai Leu Thr Ser 
70 

Arg Ser Vai Ser 
85* 

Lys Glu Gly His 

Vai Vai Glu Lvs 
120 

He Gin Thr Asp 
135 

Arg He Phe Thr 
150 

Vai He Leu He 
165 

Leu Ser Ser Asn 

Pro Glu Leu Vai 
200 

His Ala Pro Lys 
215 

Vai Leu Pro Ser 
230 

Tvr lie Asp Asp 
245 

Leu Tyr Gly Lys 

Gin Asp Gly Asp 
280 

Vai He Glu Asp 
295 

Met Glu Gly Vai 
310 

Leu Tyr Vai Ser 
325 

Glu Ala Glu Arg 

His Phe Thr Lys 
360 

Leu Met Vai Phe 
375 

Leu Vai Vai Thr 
390 

Gly Vai Ala Lys 
405 

Thr lie Thr Vai 

Ala Thr Lys Thr 
440 

Asn Asn Tvr Leu 
455 

Asp Asn Leu Asn 
470 

Ala Lys lie Arg 
485 

Leu Lys Ala Gly 

Leu Ser Leu Pro 
520 

Ala Tyr Tyr Thr 
535 

Asp Ser Vai Trp 
550 

Vai Lys Gly Asp 
565 

Thr Leu Arg lie 
Vai Asp Lys Gly 



40* 



Vai 


Thr 


Vai 


Thr 








C f\ 

dU 


Glu 


Lys 


Thr 


Vai 






75 




lie 


Lys 


lie 


Pro 




90 






Lys 


Tyr 


Vai 


Thr 


105 








Ala 


Vai 


Met 


Vai 


Lys 


Thr 


lie 


Tyr 








140 


Vai 


Asp 


Asn 


Asn 






155 




Glu 


Thr 


Pro 


Asp 




170 






Asn 


Gin 


His 


Gly 


185 








Asn 


Met 


Gly 


Gin 


Gin 


He 


Phe 


Ser 








220 


Phe 


Glu 


Vai 


Arg 






235 




Pro 


Asn 


Gly 


Leu 




250 






Asn 


Vai 


Asd 


Glv 


265 








Lys 


Lys 


lie 


Ser 


Gly Vai 


Gly 


Asp 








300 


Arg 


Pro 


Ser 


Asn 






315 




Vai 


Thr 


Vai 


He 




330 






Ser 


Gly 


lie 


Pro 


345 








Thr 


Pro 


Lys 


Phe 


Vai 


Thr 


Asn 


Pro 








380 


Gin Gly 


Ser 


Asn 






395 




Leu 


Ser 


lie 


Asn 




410 






Arg 


Thr 


Lys 


Lys 


425 








Met 


Glu 


Ala 


His 


His 


Leu 


Ser 


Vai 








460 


Vai 


Asn 


Phe 


His 






475 




Tyr 


Tyr 


Thr 


Tyr 


490 






Arg 


Gin 


Vai 


Arg 


505 








He 


Thr 


Pro 


Glu 


Leu 


lie 


Gly 


Ala 








540 


Vai 


Asp 


Vai 


Lys 






555 




Pro 


Arg 


Asp 


Asn 




570 






Glu 


Gly 


Asn 


Gin 


585 








Vai 


Phe 


Vai 


Leu 



45 








Vai 


Gin 


Asp 


Phe 


Leu 


Thr 


Gly 


Ala 






80 


Ala 


Ser 


Lvs 


Glu 






95 




vai 


Vai 


Ala 


Asn 




110 






Ser 


Phe 


Gin 


Ser 


125 








Thr 


Pro 


Gly 


Ser 


Leu 


Leu 


Pro 


Val 








160 


Gly lie 


Pro 


val 






175 




lie 


Leu 


Pro 


Leu 




190 






Trp Lys 


lie 


Aror 


205 








Ala 


GlU 


Phe 


Glu 


Vai 


Glu 


Pro 


Thr 








240 


Glu 


Vai 


Ser 


He 






255 




Thr 


Ala 


Phe 


Val 




270 






Leu 


Ala 


His 


Ser 


285 








Ala 


vai 


Leu 


Thr 


Ala Asp 


Ala 


Leu 








320 


Leu 


His 


Ser 


Gly 






335 




He 


Vai 


Thr 


Ser 




350 






Phe 


Lys 


Pro 


Ala 


365 








Asp Gly 


Ser 


Pro 


Ala 


Lys 


Ala 


Leu 








400 


Thr 


Pro 


Asn 


Ser 






415 




Asp 


Thr 


Leu 


Pro 




430 






Pro 


Tyr 


Ser 


Thr 


445 








Ser 


Arg 


Met 


Glu 


Leu Arg 


Thr 


Asd 








480 


Leu 


Val 


Met 


Asn 






495 




Glu 


Pro 


Glv 


Gin 




510 






Phe 


He 


Pro 


Ser 


525 








Ser Gly 


Gin 


Arq 


Asp Ser 


Cys 


lie 








560 


His 


Leu 


Ala 


Pro 






575 




Gly 


Ala 


Arg 


Val 




590 






Asn 


Lys 


Lys 


Asn 
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595 600 605 

Lys Leu Thr Gin Ser Lys lie Trp Asp Val Val Glu Lys Ala Asp lie 

610 615 620 

Gly Cys Thr Pro Gly Ser Glv Lys Asn Tyr Ala Gly Val Phe Met Asp 
625 630 635 640 

Ala Gly Leu Ala Phe Lys Thr Ser Gin Gly Leu Gin Thr Glu Gin Arg 

645 650 655 

Ala Asp Leu Glu Cys Thr Lys Pro Ala Ala Arg Arg Arg Arg Ser Val 

660 665 670 

Gin Leu Met Glu Arg Arg Met Asn Lys Ala Gly Gin Tyr Thr Asp Lys 

675 680 685 

Gly Leu Arg Lys Cys Cys Glu Asp Gly Met Arg Asp lie Pro Met Arg 

690 695 700 

Tyr Ser Cys Gin Arg Arg Ala Arg Leu lie Thr Gin Gly Glu Asn Cys 
705 710 715 720 

lie Lys Ala Phe He Asp Cvs Cys Asn His He Thr Lys Leu Arg Glu 

725 730 735 

Gin His Arg Arg Asp His Val Leu Gly Leu Ala Arg Ser Glu Leu Glu 

740 745 750 

Glu Asp He He Pro Glu Glu Asp He He Ser Arg Ser His Phe Pro 

755 760 765 

Gin Ser Trp Leu Trp Thr lie Glu Glu Leu Lys Glu Pro Glu Lys Asn 

770 775 780 

Gly He Ser Thr Lys Val Met Asn He Phe Leu Lys Asp Ser He Thr 
785 790 795 800 

Thr Trp Glu He Leu Ala Val Ser Leu Ser Asp Lys Lys Gly He Cys 

805 8X0 815 

Val Ala Asp Pro Tyr Glu He Arg Val Met Gin Asp Phe Phe He Asp 

820 825 830 

Leu Arg Leu Pro Tyr Ser Val Val Arg Asn Glu Gin Val Glu He Arg 

835 840 845 

Ala Val Leu Phe Asn Tyr Arg Glu Gin Gin Glu Leu Lys Val Arg Val 

850 855 860 

Glu Leu Leu His Asn Pro Ala Phe Cys Ser Met Ala Thr Ala Lys Asn 
865 870 875 880 

Arg Tyr Phe Gin Thr He Lys lie Pro Pro Lys Ser Ser Val Ala Val 

885 890 895 

Pro Tyr Val He Val Pro Leu Lys He Gly Gin Gin Glu Val Glu Val 

900 905 910 

Lys Ala Ala Val Phe Asn His Phe lie Ser Asp Gly Val Lys Lys Thr 

915 920 925 

Leu Lys Val Val Pro Glu Gly Met Arg lie Asn Lys Thr Val Ala He 

930 935 940 

His Thr Leu Asp Pro Glu Lvs Leu Gly Gin Gly Gly Val Gin Lys Val 
945 950 * 955 960 

Asp Val Pro Ala Ala Asp Leu Ser Asp Gin Val Pro Asp Thr Asp Ser 

965 970 975 

Glu Thr Arg He He Leu Gin Gly Ser Pro Val Val Gin Met Ala Glu 

980 985 990 

Asp Ala Val Asp Gly Glu Arg Leu Lys His Leu He Val Thr Pro Ala 

995 1000 1005 

Gly Cys Gly Glu Gin Asn Met He Gly Met Thr Pro Thr Val lie Ala 

1010 1015 1020 

val His Tyr Leu Asp Gin Thr Glu Gin Trp Glu Lys Phe Gly He Glu 
025 1030 1035 1040 

Lys Arg Gin Glu Ala Leu Glu Leu lie Lys Lys Gly Tyr Thr Gin Gin 

1045 1050 1055 

Leu Ala Phe Lys Gin Pro Ser Ser Ala Tyr Ala Ala Phe Asn Asn Arg 

1060 1065 1070 

Pro Pro Ser Thr Trp Leu Thr Ala Tyr Val Val Lys Val Phe Ser Leu 

1075 1080 1085 

Ala Ala Asn Leu He Ala lie Asp Ser His Val Leu Cys Gly Ala Val 

1090 1095 1100 

Lvs Trp Leu He Leu Glu Lys Gin Lys Pro Asp Gly Val Phe Gin Glu 
loS 1110 HIS H20 

Asd Glv Pro Val He His Gin Glu Met lie Gly Gly Phe Arg Asn Ala 

* 1125 H30 1135 

Lvs Glu Ala Asp Val Ser Leu Thr Ala Phe Val Leu He Ala Leu Gin 

7 H40 H45 1150 

Glu Ala Arg Asp lie Cys Glu Gly Gin Val Asn Ser Leu Pro Gly Ser 
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1155 1160 1165 

He Asn Lys Ala Gly Glu Tyr lie Glu Ala Ser Tyr Met Asn Leu Gin 

1170 1175 1180 

Arg Pro Tyr Thr Val Ala He Ala Gly Tyr Ala Leu Ala Leu Met Asn 
185 1190 1195 1200 

Lvs Leu Glu Glu Pro Tyr Leu Glv Lvs Phe Leu Asn Thr Ala Lys Asp 

1205 " 1210 1215 

Arg Asn Arg Trp Glu Glu Pro Asp Gin Gin Leu Tyr Asn Val Glu Ala 

1220 1225 1230 

Thr Ser Tyr Ala Leu Leu Ala Leu Leu Leu Leu Lys Asp Phe Asp Ser 

1235 1240 1245 

Val Pro Pro Val Val Arg Trp Leu Asn Glu Gin Arg Tyr Tyr Gly Gly 

1250 1255 1260 

Gly Tyr Gly Ser Thr Gin Ala Thr Phe Met Val Phe Gin Ala Leu Ala 
265 1270 1275 1280 

Gin Tyr Gin Thr Asp Val Pro Asp His Lys Asp Leu Asn Met Asp Val 

1285 1290 1295 

Ser Phe His Leu Pro Ser Arg Ser Ser Ala Thr Thr Phe Arg Leu Leu 

1300 1305 1310 

Trp Glu Asn Gly Asn Leu Leu Arg Ser Glu Glu Thr Lys Gin Asn Glu 

1315 1320 1325 

Ala Phe Ser Leu Thr Ala Lvs Gly Lys Gly Arg Gly Thr Leu Ser Val 

1330 1335 1340 

Val Ala Val Tvr His Ala Lys Leu Lvs Ser Lys Val Thr Cys Lys Lys 
345 * 1350 1355 1360 

Phe Asd Leu Arg Val Ser He Arg Pro Ala Pro Glu Thr Ala Lys Lys 

1365 1370 1375 

Pro Glu Glu Ala Lys Asn Thr Met Phe Leu Glu He Cys Thr Lys Tyr 

1380 1385 1390 

Leu Gly Asp Val Asp Ala Thr Met Ser He Leu Asp He Ser Met Met 

1395 1400 1405 

Thr Gly Phe Ala Pro Asp Thr Lys Asp Leu Glu Leu Leu Ala Ser Gly 

1410 1415 1420 

Val Asp Arg Tyr He Ser Lys Tyr Glu Met Asn Lys Ala Phe Ser Asn 
425 1430 1435 1440 

Lys Asn Thr Leu He He Tyr Leu Glu Lys He Ser His Thr Glu Glu 

1445 1450 1455 

Asp Cys Leu Thr Phe Lys Val His Gin Tyr Phe Asn Val Gly Leu He 

1460 1465 1470 

Gin Pro Gly Ser Val Lys Val Tyr Ser Tyr Tyr Asn Leu Glu Glu Ser 

1475 1480 1485 

Cvs Thr Arg Phe Tyr His Pro Glu Lys Asp Asp Gly Met Leu Ser Lys 

1490 1495 1500 

Leu Cys His Ser Glu Met Cys Arg Cvs Ala Glu Glu Asn Cys Phe Met 
505 ' 1510 * 1515 1520 

Gin Gin Ser Gin Glu Lys He Asn Leu Asn Val Arg Leu Asp Lys Ala 

1525 1530 1535 

Cys Glu Pro Gly Val Asp Tyr Val Tyr Lys Thr Glu Leu Thr Asn He 

1540 1545 1550 

Lys Leu Leu Asp Asp Phe Asp Glu Tyr Thr Met Thr He Gin Gin Val 

1555 1560 1565 

He Lys Ser Gly Ser Asp Glu Val Gin Ala Gly Gin Gin Arg Lys Phe 

1570 1575 1580 

He Ser His He Lys Cys Arg Asn Ala Leu Lys Leu Gin Lys Gly Lys 
585 1590 1595 1600 

Lys Tyr Leu Met Trp Gly Leu Ser Ser Asp Leu Trp Gly Glu Lys Pro 

1605 1610 1615 

Asn Thr Ser Tvr He lie Gly Lys Asp Thr Trp Val Glu His Trp Pro 

1620 1625 1630 

Glu Ala Glu Glu Cys Gin Asp Gin Lys Tyr Gin Lys Gin Cys Glu Glu 

1635 1640 1645 

Leu Gly Ala Phe Thr Glu Ser Met Val Val Tyr Gly Cys Pro Asn 
1650 1655 1660 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1744 amino acids 

(B) TYPE: ammo acid 

(C) STRAND EDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 
<vi) ORIGINAL SOURCE: 





(xi) SEQUENCE 


DESCRIPTION 


Met 


Arg 


Leu 


Leu 


Trp 

c 

b 


uiy 


Leu 


lie 


1 
Ser 


Leu 


Gin 


Lys 


Pro 


Arg 


Leu 


Leu 








20 










Leu 


Gly 


Val 


Pro 


Leu 


Ser 


vai 


uiy 




35 










a n 
4 U 


Gly 


Gin 


Val 


Val 


Lys 


Gly 


Ser 


vai 


50 










c c 




Asn 


Val 


Pro 


Cys 


ser 


Pro 


Lys 


vdl 


65 










/ u 






Asp 


Phe 


Ala 


Leu 


Leu 


Ser 


Leu 


Gin 








85 








Cys 


Gly 


Leu 


His 


Gin 


Leu 


Leu 


Arg 








100 










His 


Ser 


Pro 


Trp 


Leu 


Lys 


Asp 


Ser 






115 










IzU 


Gly 


He 


Asn 


Leu 


Leu 


Phe 


Ser 


Ser 


130 










TIC 

135 




Thr 


Asp 


Gin 


Pro 


He 


Tyr 


Asn 


Pro 


145 








150 






Phe 


Ala 


Leu 


Asp 


Gin 


Lys 


Met 


Arg 










165 








Met 


Val 


Glu 


Asn 


Ser 


His 


Gly 


Leu 








180 










Met 


Pro 


Ser 


Ser 


He 


Phe 


Gin 


Asp 






195 










z uu 


Glu 


Pro 


Gly 


Thr 


Trp 


Lys 


He 


Ser 




210 










215 




Ser 


Asn 


Ser 


Ser 


Thr 


Gin 


Phe 


Glu 


225 










230 






Phe 


Glu 


Val 


Lys 


He 


Thr 


Pro 


Gly 










245 








Gly 


His 


Leu 


Asp 


Glu 


Met 


Gin 


Leu 






260 










Gly 


Lys 


Pro 


Val 


Gin 


Gly 


Val 


Ala 


275 










280 


Glu 


Asp 


Gly 


Lys 


Lys 


Thr 


Phe 


Phe 




290 










295 




Leu 


Val 


Asn 


Gly 


Gin 


Ser 


His 


lie 


305 








310 






Asp 


Ala 


Leu 


Glu 


Lys 


Leu 


Asn 


Met 








325 








Ara 


Leu 


Tyr 


Val 


Ala 


Ala 


Ala 


He 








340 










Glu 


Glu 


Ala 


Glu 


Leu 


Thr 


Ser 


Trp 






355 










360 


Leu 


Asp 


Leu 


Ser 


Lys 


Thr 


Lys 


Arg 




370 










375 




Leu 


Leu 


Gin 


Ala 


Leu 


Val 


Arg 


Glu 


3B5 










390 






He 


Pro 


Val 


Lys 


Val 


Ser 


Ala 


Thr 








405 








Glu 


Ala 


Gin 


Asp 


He 


Gin 


Gin 


Asn 








420 










He 


Pro 


He 


He 


He 


Pro 


Gin 


Thr 






435 










440 


Ser 


Ala 


Gly 


Ser 


Pro 


His 


Pro 


Ala 




450 








455 




Pro 


Pro 


Ser 


Gly 


Gly 


Pro 


Gly 


Phe 


465 










470 







SEQ ID NO: 9: 



Trp 


Ala 


Ser 


Ser 


Phe Phe 


Thr 


Leu 


10 








15 




Leu 


Phe 


Ser 


Pro 


Ser Val 


Val 


His 


25 








30 






Val 


Gin 


Leu 


Gin 


Asp Val 


Pro 


Arg 










45 






Phe 


Leu 


Arg 


Asn 


Pro Ser 


Arg 


Asn 






60 








Asp 


Phe 


Thr 


Leu 


Ser Ser 


Glu 


Arg 






75 








80 


Val 


Pro 


Leu 


Lys Asp Ala 


Lys 


Ser 




90 








95 




Gly 


Pro 


GlU 


Val 


Gin Leu 


Val 


Ala 


105 








110 






Leu 


Ser 


Arg 


Thr 


Thr Asn 


He 


Gin 








125 






Arg 


Arg 


Gly 


His 


Leu Phe 


Leu 


Gin 








140 








Gly 


Gin 


Arg 


Val 


Arg Tyr 


Arg 


Val 






155 








160 


Pro 


Ser 


Thr 


Asp 


Thr lie 


Thr 


Val 




170 






175 




Arg 


Val 


Arg 


Lys 


Lys Glu 


Val 


Tyr 


185 








190 






Asp 


Phe 


Val 


He 


Pro Asp 


He 


Ser 










205 






Ala 


Arg 


Phe 


Ser Asp Gly 


Leu 


Glu 








220 








Val 


Lys 


Lys 


Tyr 


Val Leu 


Pro 


Asn 






235 








240 


Lys 


Pro 


Tyr 


He 


Leu Thr 


Val 


Pro 


250 






255 




Asp 


He 


Gin 


Ala 


Arg Tyr 


He 


Tyr 


265 








270 






Tyr 


Val 


Arg 


Phe 


Gly Leu 


Leu 


Asp 










285 






Arg 


Gly 


Leu 


Glu 


Ser Gin 


Thr 


Lys 








300 








Ser 


Leu 


Ser 


Lys 


Ala' Glu 


Phe 


Gin 






315 








320 


Gly 


He 


Thr 


Asp 


Leu Gin 


Gly 


Leu 




330 








335 




He 


Glu 


Ser 


Pro Gly Gly 


Glu 


Met 


345 








350 






Tyr 


Phe 


Val 


Ser 


Ser Pro 


Phe 


Ser 








365 






His 


Leu 


Val 


Pro 


Gly Ala 


Pro 


Phe 








380 








Met 


Ser 


Gly 


Ser 


Pro Ala 


Ser 


Gly 






395 








400 


Val 


Ser 


Ser 


Pro Gly Ser 


Val 


Pro 




410 








415 




Thr 


Asp 


Gly 


Ser 


Gly Gin 


Val 


Ser 


425 








430 






He 


Ser 


Glu 


Leu 


Gin Leu 


Ser 


Val 










445 






He 


Ala 


Arg 


Leu 


Thr Val 


Ala 


Ala 






460 








Leu 


Ser 


He 


GlU 


Arg Pro 


Asp 


Ser 






475 








480 
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Arg 


Pro 


Pro 


Arg 


vdl 


p 1 w 
uiy 


Asp 


Thr 


Leu 


Asn 


Levi Asn Leu Aro 


Ala 


Val 






485 










490 




495 




\sX j 


Ser 


ui y 


xvl d 


Thr 


Phe 


Ser 


His 


Tvr 


Tvr 

j 


Tyr Met lie Leu 


Ser 


Arq 




500 










505 




510 








bin 


11c 


Val 


flic 


1 IC L. 


Asn 


*vxy 


Glu 


Pro 


Lvs Aro Thr Leu 


Thr 


Ser 




515 










520 






525 






vai 


O A V 

ser 


IT- 1 

vai 


i*ne 


Val 




Hi <; 
nx i> 


His 
nxo 


Leu 


Ala 
i"LX a 


Pro S^r ohp Tvr 

XrXw wCX ilw x y j. 


Phe 


Val 




530 










53 5 








540 






Ala 


file 


Tyr 


Tyr 


Hie 


Gly 


Aso 


His 


Pro 


Val 


Ala Asn Ser Leu 


Ara 


Val 


545 






550 










555 


Val 


560 


ASp 


vai 


pi m 

bin 


/ila 


gi v 

ux y 


Ala 

/~tx a 


Pvs 
v-y^ 


Glu 


Glv 


Lys 


Leu Glu Leu Ser 


Asd 








56 5 

WWW 










570 




575 




P 1 »r 

biy 


TV "I -j 

Ala 


Lys 


oxli 


Tyr 


/iiy 


Asn 


Glv 
ux y 


Glu 


Ser 


Val Lvs Leu His 


Leu 


Glu 




con 

3 O w 










585 




590 






TUX 


ASp 


SCI 


Leu 


ni a 

Hid 


T .an 


Val 


Ala 


Leu 


Gly 


Ala Leu Asn Thr 


Ala 


Leu 














600 






605 






Tyr 


Aia 


Ala 


uiy 


Ser 


Lys 


Cor 
OCX 




_ 

Lys 


c L w 


T.aii A en Mot* nlv 
Xjcix nail l ie u vjxy 


Lys 


Val 


Cin 
O 1U 










615 








620 






Pne 


p 1 * * 


Aia 




Asn 


ser 




Act** 


Leu 


wx y 


pvc Glv Pro Glv 


Glv 


Glv 

vsxy 


62 S 










D J w 










0 j w 




640 


Asp 


Ser 


Aia 


Leu 


r*l n 
uin 


Val 


Php 


gi n 


nXCl 


Al a 


Glv T.P11 Ala Php 
uiy jjcu nxa xriic 


Ser 


Asp 








1 4 c 










650 




655 




Giy 


Asp 


Pi r*i 

Lain 


Trp 


lUl 


Leu 


OCX 








T.pn Cay pvc Drn 

wCX \>Jrd * X. W 


Lys 


Glu 




DO U 










O O 3 




670 






Lys 


Thr 


Thr 


Arg 


Lys 


Lys 


Arg 


Asn 


Veil 


Asn 


irne uin Livs /iia 


Tl A 
lie 


XTOli 




675 










con 
OOU 












Glu 


Lys 


Leu 


Gly 


Gin 


Tyr 


Aia 


C ^1 -v- 

oer 


Pro 


xnr 


Aia i<ys Arg uys 


i_ys 


Gl n 




690 








/- Q C 














Asp 


Gly 


Val 


Thr 


Arg 


Leu 


Pro 


Met 


Met 


Arg 


Ser Cys Glu Gin 


Arg 


/via 


705 








710 










/l3 




/ * u 


Ala 


Arg 


Val 


Gin 


Gin 


Pro 


ASp 


cys 


Arg 


G1U 


pro pne Lieu oer 


cys 


Cys 








725 


















Gin 


Phe 


Ala 


Glu 


Ser 


Leu 


Arg 


Lys 


Lys 


Ser 


Arg Asp Lys Gly 


p 1 — 
ulli 


ai a 

Ala 








740 










"7 A C 
f*k b 




/3U 






Gly 


Leu 


Gin 


Arg 


Ala 


Leu 


Glu 


lie 


Leu 


uin 


G1U blU ASp lieu 


Tl a 
lie 


ASp 




•7CC 

75b 














765 






Glu 


Asp 


Asp 


lie 


pro 


vai 


Arg 


Ser 


flic 


XT XiC 


rtu VjIU nsu ^^tr 


Leu 


Tttj 




770 








/ / 3 








7R0 






Arg 


vai 


G1U 


Tnr 


Val 


Asp 


Arg 


IT l its 


Gin 


Tip 
X xc 


T.ati Thi" Lpti Tt*T5 


Leu 


Pro 












Ton 










795 




800 


Asp 


Ser 


Leu 


mr 


inr 


Trp 


wX U 


Tl a 
lie 


Hi c: 
Ills 


Pi v" 

0 x y 


T.011 Cpr T.pti S^T* 


Lys 


Thr 








one 

oUb 










OlU 




815 




Lys 


Gly 


Leu 


Cys 


val 


Aia 


xnr 


Pro 


Val 


uin 


Leu /vx^j vax Xriic 


Aiy 


Glu 














0 * 3 




830 






Phe 


His 


Leu 


His 


Leu 


Arg 


Leu 


Pro 


Mat* 


Ser 


\f n 1 7A T-rr hrrr Dhp 
Val Ax y niy iriie 


W X LI 


Gin 






835 










Q A ft 






O ft J 






Leu 


Glu 


Leu 


Arg 


Pro 


vai 


Leu 


Tyr 


Asn 


Tyr 


T ah A en T \/c Ren 
LcU /voLy l-i yi> rlsil 


Tjaii 
uc u 


Thr 

XilX 




850 








a q ^ 

Q33 








O V w 






Val 


Ser 


Val 


His 


vai 


Ser 


Pro 


\7a 1 
val 


Gl 11 


uiy 


T an Pvc T.aii Hi a 
ucu Lys xitsu ax a 


Glv 
oxy 


Glv 
oxy 


865 










070 

O f w 










875 




880 


Gly 


Gly 


Leu 


Ala 


Gin 


pi n 

Gin 


val 


Leu 


Va 1 
Val 


n 


Ala Glv Qpr al a 
Ala uiy ocx /\xa 


Arg 


Pro 






QQC 

ODD 










O 7 w 




695 




Val 


Ala 


Phe 


Ser 


Val 


vai 


Pro 


inr 


M a 


Ala 
/ila 


r\la nia vax ocx 


Leu 


Lys 








0 rt A 










7U J 




910 






Val 


Val 


Ala 


Arg 


Gly 


ser 


irne 


blu 


rnc 


Pro 


Val Oiy >VoLJ Mia 


vax 


Ser 






915 






y^\j 






7 ^ v 






Lys 


Val 


Leu 


Gin 


lie 


Glu 


Lys 


Pin 

G1U 


pi 

«iy 


Ala 

Aia 


lie rtl5 /ixy uiu 


Gl u 

UlU 


iiCU 


930 










QIC 










Glu 
uiu 


lie 

xxc 


Val 


Tyr 


Glu 


Leu 


Asn 


Pro 


Leu 


Asp 


U-i e 
ill 5 


Arg 


uiy Ai^ 1 iix ucu 


945 








9bU 














960 


Pro 


Gly 


Asn 


Ser 


Asp 


Pro 


Asn 


Met 


xie 


pro 


Asp vaiy Asp fne 


noil 


Car 
OCX 








965 










pin 




975 




Tyr 


Val 


Arg 


vai 


1 ill 


Ala 


Ser 


Asp 


Pro 


Leu 


Asp Thr Leu Gly 


Ser 


Glu 




980 










985 




990 






Gly 


Ala 


Leu 


Ser 


Pro 


Gly 


Gly 


Val 


Ala 


Ser 


L . Leu Arg Leu 


Pro 


Arg 




995 








1000 






1005 






Gly 

] 


Cys 


Gly 


Glu 


Gin 


Thr 


Met 


He 


Tyr 


Leu 


Ala Pro Thr Leu 


Ala 


Ala 


L010 






1015 








1020 






Ser Arg 


Tyr 


Leu 


Asp 

1 


Lys 


Thr 


Glu 


Gin 


Trp 


Ser Thr Leu Pro 


Pro 


Glu 


025 






L030 








1035 


1040 
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Thr Lys Asd His Ala Val Asd Leu He Gin Lys Gly Tyr Met Ara He 

1045 " 1050 1055 

Gin Gin Phe Arg Lys Ala Asp Gly Ser Tyr Ala Ala Trp Leu Ser Arg 

1060 1065 1070 

Asp Ser Ser Thr Trp Leu Thr Ala Phe Val Leu Lys Val Leu Ser Leu 

1075 1080 1085 

Ala Gin Glu Gin Val Gly Gly Ser Pro Glu Lys Leu Gin Glu Thr Ser 

1090 1095 1100 

Asn Trp Leu Leu Ser Gin Gin Gin Ala Asp Gly Ser Phe Gin Asp Pro 
105 1110 1115 1120 

Cys Pro Val Leu Asp Arg Ser Met Gin Gly Gly Leu Val Gly Asn Asp 

1125 1130 1135 

Glu Thr Val Ala Leu Thr Ala Phe Val Thr He Ala Leu His His Gly 

1140 1145 1150 

Leu Ala Val Phe Gin Asp Glu Gly Ala Glu Pro Leu Lys Gin Arg Val 

1155 1160 1165 

Glu Ala Ser He Ser Lys Ala Asn Ser Phe Leu Gly Glu Lys Ala Ser 

1170 1175 1180 

Ala Gly Leu Leu Gly Ala His Ala Ala Ala He Thr Ala Tyr Ala Leu 
185 1190 1195 1200 

Ser Leu Thr Lys Ala Pro Val Asp Leu Leu Gly Val Ala His Asn Asn 

1205 1210 1215 

Leu Met Ala Met Ala Gin Glu Thr Gly Asp Asn Leu Tyr Trp Gly Ser 

1220 1225 1230 

Val Thr Gly Ser Gin Ser Asn Ala Val Ser Pro Thr Pro Ala Pro Arg 

1235 1240 1245 

Asn Pro Ser Asp Pro Met Pro Gin Ala Pro Ala Leu Trp He Glu Thr 

1250 1255 1260 

Thr Ala Tyr Ala Leu Leu His Leu Leu Leu His Glu Gly Lys Ala Glu 
265 1270 1275 1280 

Met Ala Asp Gin Ala Ser Ala Trp Leu Thr Arg Gin Gly Ser Phe Gin 

1285 1290 1295 

Gly Gly Phe Arg Ser Thr Gin Asp Thr Val He Ala Leu Asp Ala Leu 

1300 1305 1310 

Ser Ala Tyr Trp He Ala Ser His Thr Thr Glu Glu Arg Gly Leu Asn 

1315 1320 1325 

Val Thr Leu Ser Ser Thr Gly Arg Asn Gly Phe Lys Ser His Ala Leu 

1330 1335 1340 

Gin Leu Asn Asn Arg Gin He Arg Gly Leu Glu Glu Glu Leu Gin Phe 
345 1350 1355 1360 

Ser Leu Gly Ser Lys He Asn Val Lys Val Gly Gly Asn Ser Lvs Gly 

1365 1370 1375 

Thr Leu Lys Val Leu Arg Thr Tyr Asn Val Leu Asp Met Lys Asn Thr 

1380 1385 1390 

Thr Cys Gin Asp Leu Gin He Glu Val Thr Val Lys Gly His Val Glu 

1395 1400 1405 

Tyr Thr Met Glu Ala Asn Glu Asp Tyr Glu "Asp Tyr Glu Tyr Asp Glu 

1410 1415 1420 

Leu Pro Ala Lys Asp Asp Pro Asp Ala Pro Leu Gin Pro Val Thr Pro 
425 1430 1435 1440 

Leu Gin Leu Phe Glu Glv Arg Arg Asn Arg Arg Arg Arg Glu Ala Pro 

1445 1450 1455 

Lys Val Val Glu Glu Gin Glu Ser Arg Val His Tyr Thr Val Cys He 

1460 1465 1470 

Trp Arg Asn Gly Lys Val Gly Leu Ser Gly Met Ala He Ala Asp Val 

1475 1480 1485 

Thr Leu Leu Ser Gly Phe His Ala Leu Arg Ala Asp Leu Glu Lys Leu 

1490 1495 1500 

Thr Ser Leu Ser Asp Arg Tyr Val Ser His Phe Glu Thr Glu Gly Pro 
505 1510 1515 1520 

His Val Leu Leu Tyr Phe Asp Ser Val Pro Thr Ser Arg Glu Cys Val 

1525 1530 1535 

Gly Phe Glu Ala Val Gin Glu Val Pro Val Gly Leu Val Gin Pro Ala 

1540 1545 1550 

Ser Ala Thr Leu Tyr Asp Tyr Tyr Asn Pro Glu Arg Arg Cys Ser Val 

1555 1560 1565 

Phe Tvr Gly Ala Pro Ser Lys Ser Arg Leu Leu Ala Thr Leu Cys Ser 

1570 1575 1580 

Ala Glu Val Cys Gin Cys Ala Glu Gly Lys Cys Pro Arg Gin Arg Arg 
585 1590 1595 1600 
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Ala Leu Glu Arg Gly Leu Gin Asp Glu Aso Gly Tyr Arg Met Lys Phe 

1605 1610 1615 

Ala Cys Tyr Tvr Pro Arg Val Glu Tyr Gly Phe Gin Val Lys Val Leu 

1620 1625 1630 

Arg Glu Asp Ser Arg Ala Ala Phe Arg Leu Phe Glu Thr Lys lie Thr 

1635 1640 1645 

Gin Val Leu His Phe Thr Lys Asp Val Lys Ala Ala Ala Asn Gin Met 

1650 1655 1660 

Arg Asn Phe Leu Val Arg Ala Ser Cys Arg Leu Arg Leu Glu Pro Gly 
665 1670 ~*1675 1680 

Lvs Glu Tyr Leu lie Met Gly Leu Asp Gly Ala Thr Tyr Asp Leu Glu 

1685 1690 1695 

Gly His Pro Gin Tyr Leu Leu Asp Ser Asn Ser Trp lie Glu Glu Met 

1700 # 1705 1710 

Pro Ser Glu Arg Leu Cys Arg Ser Thr Arg Gin Arg Ala Ala Cys Ala 

1715 " 1720 1725 

Gin Leu Asn Asp Phe Leu Gin Glu Tyr Gly Thr Gin Gly Cys Gin Val 
1730 1735 * 1740 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1738 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 10 : 








Met 


Arg 


Leu 


Leu 


Trp 


Gly 


Leu 


Ala 


Trp 


Val 


Phe 


Ser 


Phe 


Cys 


Ala Ser 


1 








5 










10 










15 


Ser 


Leu 


Gin 


Lys 


Pro 


Arg 


Leu 


Leu 


Leu 


Phe 


Ser 


Pro 


Ser 


Val 


Val Asn 








20 










25 










30 




Leu 


Gly 


Thr 


Pro 


Leu 


Ser 


Val 


Gly 


Val 


Gin 


Leu 


Leu 


Asp 


Ala 


Pro Pro 




35 










40 










45 






Gly 


Gin 


Glu 


Val 


Lys 


Gly 


Ser 


Val 


Phe 


Leu 


Arg 


Asn 


Pro 


Lys 


Gly Gly 


50 










55 










60 








Ser 


Cys 


Ser 


Pro 


Lys 


Lys 


Asp 


Phe 


Lys 


Leu 


Ser 


Ser 


Gly 


Asp 


Asp Phe 


65 








70 










75 








80 


Val 


Leu 


Leu 


Ser 


Leu 


Glu 


Val 


Pro 


Leu 


Glu 


Asp 


Val 


Arg 


Ser 


Cys Gly 










85 










90 










95 


Leu 


Phe 


Asp 


Leu 


Arg 


Arg 


Ala 


Pro 


His 


He 


Gin 


Leu 


Val 


Ala 


Gin Ser 






100 










105 










110 




Pro 


Trp 


Leu 


Arg 


Asn 


Thr 


Ala 


Phe 


Lys 


Ala 


Thr 


Glu 


Thr 


Gin 


Gly Val 




115 








120 










125 






Asn 


Leu 


Leu 


Phe 


Ser 


Ser 


Arg 


Arg 


Gly 


His 


lie 


Phe 


Val 


Gin 


Thr Asp 




130 










135 










140 








Gin 


Pro 


He 


Tyr 


Asn 


Pro 


Gly 


Gin 


Arg 


Val 


Arg 


Tyr 


Arg 


Val 


Phe Ala 


145 








150 










155 








160 


Leu 


Asp 


Gin 


Lys 


Met 


Arg 


Pro 


Ser 


Thr 


Asp 


Phe 


Leu 


Thr 


He 


Thr Val 






165 










170 










175 


Glu 


Asn 


Ser 


Kis 


Gly 


Leu 


Arg 


Val 


Leu 


Lys 


Lys 


Glu 


He 


Phe 


Thr Ser 








180 








185 










190 




Thr 


Ser 


He 


Phe 


Gin 


Asp 


Ala 


Phe 


Thr 


He 


Pro 


Asp 


He 


Ser 


Glu Pro 






195 










200 










205 






Gly 


Thr 


Trp 


Lys 


lie 


Ser 


Ala 


Arg 


Phe 


Ser 


Asp 


Gly 


Leu 


Glu 


Ser Asn 


210 








215 










220 








Arg 


Ser 


Thr 


His 


Phe 


Glu 


Val 


Lys 


Lys 


Tyr 


Val 


Leu 


Pro 


Asn 


Phe Glu 


225 










230 










235 








240 


Val 


Lys 


He 


Thr 


Pro 


Trp 


Lys 


Pro 


Tyr 


He 


Leu 


Met 


Val 


Pro 


Ser Asn 








245 










250 










255 


Ser 


Asp 


Glu 


He 


Gin 


Leu 


Asp 


He 


Gin 


Ala 


Arg 


Tyr 


He 


Tyr 


Gly Lys 






260 










265 










270 




Pro 


Val 


Gin 


Gly 


Val 


Ala 


Tyr 


Thr 


Arg 


Phe 


Ala 


Leu 


Met 


Asp 


Glu Gin 
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Gly 


Lys 


Arg 


Thr 




290 






Glu 


Gly 


Arg 


Thr 


305 








Leu 


Asp 


Lys 


He 


Tvx 
j 


Ala 


Ala 


Thr 








340 


Ala 


Glu 


Leu 


Thr 






355 




Leu 


Ser 


Arg 


Thr 




370 






Gin 


Ala 


Leu 


Val 


385 








Val 


Lys 


Val 


Ser 


Asp 


He 


Gin 


Gin 








420 


Pro 


He 


Pro 


Pro 






435 




Ser 


Leu 


Tyr 


Pro 




450 






Arg 


Gly 


Thr 


Gly 


465 








Ser 


Val 


Gly 


Asp 


Ala 


Pro 


Thr 


Phe 








500 


He 


Met 


Ala 


Met 






515 




Val 


Leu 


Val 


Asp 




530 






Phe 


Tyr 


His 


Gin 


545 








Gin 


Ser 


Arg 


Asp 






ryr 










580 


Ser 


Lys 


Ala 


Leu 






595 




Val 


Gly 


Gly 


Arg 




610 






Val 


He 


Asn 


Ser 


625 








Ala 


Leu 


Gin 


Val 




Leu 


Thr 


Gin 








660 


Ser 


Arg 


Gin 


Lys 






675 




Leu 


Gly 


Gin 


Tyr 




690 






Met 


Thr 


Lys 


Leu 


705 








Val 


Pro 


Gin 


Gin 


Ai.a 


VjXU 


Asp 


Leu 








74 0 


Arg 


Asn 


Asn 


His 






755 




Asp 


lie 


Leu 


val 




770 






Glu 


Pro 


Val 


Asp 


785 








Met 


Thr 


Thr 


Trp 


Leu 


Cys 


Val 


Ala 








820 


Leu 


His 


Leu 


Arg 









280 


Phe 


Leu 


Arg 


Gly 






295 




His 


He 


Ser 


lie 




310 






Asn 


He 


Gly 


Val 


325 








Ala 


Val 


He 


Glu 


Ser 


Trp 


Arg 


Phe 








360 


Lys 


Arg 


His 


Leu 






375 




Gin 


Glu 


Met 


Ser 




390 






Ala 


Thr 


Leu 


Val 


405 








Ser 


Thr 


Asn 


Gly 


Thr 


Val 


Thr 


Glu 








440 


Ala 


He 


Ala 


Arg 






455 




Phe 


Leu 


Ser 


He 




470 






Thr 


Phe 


He 


Leu 


485 








Ser 


His 


Tyr 


Tyr 


Gly 


Arg 


Glu 


Pro 








520 


His 


Gin 


Leu 


Ala 






535 




Gly 


His 


Pro 


Val 




550 






Cys 


Glu 


Gly 


Lys 


565 








Asn 


Ala 


Asp 


Met 


Val 


Ala 


Leu 


Gly 








600 


Ser 


His 


Lvs 


Pro 






615 




Tyr 


Asn 


Val 


Gly 




630 






Phe 


Gin 


Asp 


Ala 


645 








Thr 


Arg 


Glu 


Asp 


Arg 


Asn 


Val 


Asn 






680 


Ser 


Ser 


Pro 


Asp 






695 




Pro 


Met 


Lys 


Arg 




710 






Ala 


Cys 


Arg 


Glu 


725 








Arg 


Arg 


Asn 


Gin 


Asn 


Met 


Leu 


Gin 








760 


Arg 


Thr 


Ser 


Phe 




775 




Ser 


Ser 


Lys 


Leu 




790 






Glu 


He 


His 


Gly 


805 








Lys 


Pro 


Thr 


Arg 


Leu 


Pro 


He 


Ser 



47. 



Leu Glu Thr Gin 
300 

Ser Lys Asd Gin 
315 

Arg Asp Leu Glu 
330 

Ser Pro Gly Gly 
345 

Val Ser Ser Ala 

Val Pro Gly Ala 
380 

Gly Ser Glu Ala 
395 

Ser Gly Ser Asp 
410 

He Gly Gin Val 
425 

Leu Arg Leu Leu 

Leu Thr Val Gin 
460 

Glu Pro Leu Asp 
475 

Asn Leu Gin Pro 
490 

Tyr Met He He 
505 

Arg Lys Thr Val 

Pro Ser Phe Tyr 
540 

Ala Asn Ser Leu 
555 

Leu Gin Leu Lys 
570 

Met Lys Leu Arg 
585 

Ala Val Asp Thr 

Leu Asp Met Ser 
620 

Cys Gly Pro Gly 
635 

Gly Leu Ala Phe 
650 

Leu Ser Cys Pro 
665 

Phe Gin Lys Ala 

Ala Lys Arg Cys 
700 

Thr Cys Glu Gin 
715 

Pro Phe Leu Ser 
730 

Thr Arg Ser Gin 
745 

Glu Glu Asp Leu 

Pro Glu Asn Trp 
7B0 

Leu Thr Val Trp 
795 

Val Ser Leu Ser 
810 

Val Arg Val Phe 
825 

He Arg Arg Phe 



285 



Ala 


Lys 


Leu 


Val 


Phe 


Gin 


Ala 


Ala 








320 


Gly 


Leu 


Arg 


Leu 






335 




Glu 


Met 


Glu 


Glu 




350 






Phe 


Ser 


Leu 


Asp 


365 








His 


pne 


Leu 


Leu 


Ser 


Asn 


Val 


Pro 








400 


Ser 


Gin 


val 


Leu 






415 




Ser 


He 


Ser 


Phe 




430 






Val 


Ser 


Ala 


Gly 


445 








Ala 


Pro 


Pro 


Ser 


Pro 


Arg 


Ser 


Pro 








480 


Val 


Gly 


He 


Pro 






495 




Ser 


Arg 


Gly 


Gin 




510 






Thr 


Ser 


Val 


Ser 


525 








Phe 


Val 


Ala 


Tyr 


Leu 


He 


Asn 


He 








560 


Val 


Asp 


Gly 


Ala 






575 




He 


Gin 


Thr 


Asp 




590 






Ala 


Leu 


Tyr 


Ala 


605 








Lys 


Val 


Phe 


Glu 


Gly 


Gly 


Asp 


Asp 








640 


Ser 


Asp 


Gly 


Asp 






655 




Lys 


Glu 


Lys 


Lys 




670 






Val 


Ser 


Glu 


Lys 


665 








Cys 


Gin 


Asp 


Gly 


Arcr 


Ala 


Ala 


Arcr 








720 


Cys 


Cys 


Lys 


Phe 






735 




Ala 


His 


Leu 


Ala 




750 






He 


Asp 


Glu 


Asp 


765 








Leu 


Trp 


Arg 


Val 


Leu 


Pro 


Asp 


Ser 








800 


Lys 


Ser 


Lys 


Gly 






815 




Arg 


Lys 


Phe 


His 




830 






Glu 


Gin 


Phe 


Glu 
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835 840 845 

Leu Arg Pro Val Leu Tyr Asn Tyr Leu Asn Asp Asp Val Ala Val Ser 

850 855 860 

Val His Val Thr Pro Val Glu Gly Leu Cys Leu Ala Gly Gly Gly Met 
865 870 875 880 

Met Ala Gin Gin Val Thr Val Pro Ala Gly Ser Ala Arg Pro Val Ala 

885 890 895 

Phe Ser Val Val Pro Thr Ala Ala Ala Asn Val Pro Leu Lys Val Val 

900 905 910 

Ala Arg Gly Val Phe Asp Leu Gly Asp Ala Val Ser Lys lie Leu Gin 

915 920 925 

lie Glu Lys Glu Gly Ala lie His Arg Glu Glu Leu Val Tyr Asn Leu 

930 935 940 

Asp Pro Leu Asn Asn Leu Gly Arg Thr Leu Glu lie Pro Gly Ser Ser 
945 950 955 960 

Asp Pro Asn He Val Pro Asp Gly Asp Phe Ser Ser Leu Val Arg Val 

965 970 975 

Thr Ala Ser Glu Pro Leu Glu Thr Met Gly Ser Glu Gly Ala Leu Ser 

980 985 990 

Pro Gly Gly Val Ala Ser Leu Leu Arg Leu Pro Gin Gly Cys Ala Glu 

995 1000 1005 

Gin Thr Met He Tyr Leu Ala Pro Thr Leu Thr Ala Ser Asn Tyr Leu 

1010 1015 1020 

Asp Arg Thr Glu Gin Trp Ser Lys Leu Ser Pro Glu Thr Lys Asp His 
025 1030 1035 1040 

Ala Val Asp Leu He Gin Lys Gly Tyr Met Arg He Gin Gin Phe Arg 

1045 1050 1055 

Lys Asn Asp Gly Ser Phe Gly Ala Trp Leu His Arg Asp Ser Ser Thr 

1060 1065 1070 

Trp Leu Thr Ala Phe Val Leu Lys He Leu Ser Leu Ala Gin Glu Gin 

1075 1080 1085 

Val Gly Asn Ser Pro Glu Lys Leu Gin Glu Thr Ala Ser Trp Leu Leu 

1090 1095 1100 

Ala Gin Gin Leu Gly Asp Gly Ser Phe His Asp Pro Cys Pro Val He 
105 1110 1115 1120 

His Arg Ala Met Gin Gly Gly Leu Val Gly Ser Asp Glu Thr Val Ala 

1125 1130 1135 

Leu Thr Ala Phe Val Val He Ala Leu His His Gly Leu Asp Val Phe 

1140 1145 1150 

Gin Asp Asp Asp Ala Lys Gin Leu Lys Asn Arg Val Glu Ala Ser He 

1155 1160 1165 

Thr Lys Ala Asn Ser Phe Leu Gly Gin Lys Ala Ser Ala Gly Leu Leu 

1170 1175 1180 

Gly Ala His Ala Ala Ala He Thr Ala Tyr Ala Leu Thr Leu Thr Lys 
185 1190 1195 1200 

Ala Ser Glu Asp Leu Arg Asn Val Ala His Asn Ser Leu Met Ala Met 

1205 1210 1215 

Ala Glu Glu Thr Gly Glu His Leu Tyr Trp Gly Leu Val Leu Gly Ser 

1220 1225 1230 

Gin Asp Lys Val Val Leu Arg Pro Thr Ala Pro Arg Ser Pro Thr Glu 

1235 1240 1245 

Pro Val Pro Gin Ala Pro Ala Leu Trp He Glu Thr Thr Ala Tyr Ala 

1250 1255 1260 

Leu Leu His Leu Leu Leu Arg Glu Gly Lys Gly Lys Met Ala Asp Lys 
265 1270 1275 1280 

Ala Ala Ser Trp Leu Thr His Gin Gly Ser Phe His Gly Ala Phe Arg 

1285 1290 1295 

Ser Thr Gin Asp Thr Val Val Thr Leu Asp Ala Leu Ser Ala Tyr Trp 

1300 1305 1310 

He Ala Ser His Thr Thr Glu Glu Lvs Ala Leu Lys Val Thr Leu Ser 

1315 1320 1325 

Ser Met Gly Arg Asn Gly Leu Lys Thr His Gly Leu His Leu Asn Asn 

1330 1335 1340 

His Gin Val Lys Gly Leu Glu Glu Glu Leu Lys Phe Ser Leu Gly Ser 
345 1350 1355 1360 

Thr He Ser Val Lys Val Glu Gly Asn Ser Lvs Gly Thr Leu Lys He 

1365 1370 1375 

Leu Arg Thr Tyr Asn Val Leu Asp Met Lys Asn Thr Thr Cys Gin Asp 

1380 1385 1390 

Leu Gin He Glu Val Lys Val Thr Gly Ala Val Glu Tyr Ala Trp Asp 
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1395 1400 1405 

Ala Asn Glu Asp Tyr Glu Asp Tyr Tyr Asp Met Pro Ala Ala Asp Asp 

1410 1415 1420 

Pro Ser Val Pro Leu Gin Pro Val Thr Pro Leu Gin Leu Phe Glu Gly 
425 1430 1435 1440 

Arg Arg Ser Arg Arg Arg Arg Glu Ala Pro Lys Val Ala Glu Glu Gin 

1445 1450 1455 

Glu Ser Arg Val Gin Tyr Thr Val Cys lie Trp Arg Asn Gly Lys Leu 

1460 1465 1470 

Gly Leu Ser Gly Met Ala lie Ala Asp lie Thr Leu Leu Ser Glv Phe 

1475 1480 1485 

His Ala Leu Arg Ala Asp Leu Glu Lys Leu Thr Ser Leu Ser Asp Arg 

1490 1495 1500 

Tyr Val Ser His Phe Glu Thr Asp Gly Pro His Val Leu Leu Tyr Phe 
505 1510 1515 " 1520 

Asp Ser Val Pro Thr Thr Arg Glu Cys Val Gly Phe Gly Ala Ser Gin 

1525 1530 1535 

Glu Val Val Val Gly Leu Val Gin Pro Ser Ser Ala Val Leu Tyr Asp 

1540 1545 1550 

Tyr Tyr Ser Pro Asp His Lys Cys Ser Val Phe Tyr Ala Ala Pro Thr 

1555 1560 1565 

Lys Ser Gin Leu Leu Ala Thr Leu Cys Ser Gly Asp Val Cys Gin Cys 

1570 1575 1580 

Ala Glu Gly Lys Cys Pro Arg Leu Leu Arg Ser Leu Glu Arg Arg Val 
585 1590 1595 1600 

Glu Asp Lys Asp Gly Tyr Arg Met Arg Phe Ala Cys Tyr Tyr Pro Arg 

1605 1610 1615 

Val Glu Tyr Gly Phe Thr Val Lys Val Leu Arg Glu Asp Gly Arg Ala 

1620 1625 1630 

Ala Phe Arg Leu Phe Glu Ser Lys lie Thr Gin Val Leu His Phe Arg 

1635 1640 1645 

Lys Asp Thr Met Ala Ser lie Gly Gin Thr Arg Asn Phe Leu Ser Arg 

1650 1655 1660 

Ala Ser Cys Arg Leu Arg Leu Glu Pro Asn Lys Glu Tyr Leu lie Met 
665 1670 1675 1680 

Gly Met Asp Gly Glu Thr Ser Asp Asn Lys Gly Asp Pro Gin Tyr Leu 

1685 1690 1695 

Leu Asp Ser Asn Thr Trp lie Glu Glu Met Pro Ser Glu Gin Met Cys 

1700 1705 1710 

Lys Ser Thr Arg His Arg Ala Ala Cys Phe Gin Leu Lys Asp Phe Leu 

1715 1720 1725 

Met Glu Phe Ser Ser Arg Gly Cys Gin Val 
1730 1735 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 
<vi) ORIGINAL SOURCE: 





(xi) SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 11: 




Met 


Gly Leu Leu 


Gly 


He 


Leu 


Cys 


Phe 


Leu 


He Phe Leu Gly Lys 


Thr 


1 


5 










10 


15 




Trp 


Gly Gin Glu 


Gin 


Thr 


Tyr 


Val 


He 


Ser 


Ala Pro Lys He Phe 


Arg 


20 










25 




30 




Val 


Gly Ala Ser 


Glu 


Asn 


He 


Val 


lie 


Gin 


Val Tyr Gly Tyr Thr Glu 




35 








40 






45 




Ala 


Phe Asp Ala 


Thr 


He 


Ser 


He 


Lys 


Ser 


Tyr Pro Asp Lys Lys 


Phe 




50 






55 








60 




Ser 


Tyr Ser Ser 


Gly 


His 


Val 


His 


Leu 


Ser 


Ser Glu Asn Lys Phe 


Gin 


65 




70 










75 


80 
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Asn 


OCX 


Hi a 


Tie 

x xc 


Leu 


Thr 


lie 


Gin 










85 










Drr\ 


Val 


Ser 


Tvr 


Val 


Tyr 


Leu 








100 










Lys 


ser 


Lys 


7A T*f*T 


Met 


Pro 


lie 


Thr 














120 


nlS 




Asp 




FX V 


Val 


Tvr* 
* i *• 


Thr 














135 




vax 


Tyr 


Ser 


Leu 


noli 


Asp 


Asp 


Leu 










150 






Leu 


Thr- 
Xlli 


JrHC 


Tip 
IXC 




Pro 


Glu 


Glv 

ux y 










165 

X O J 










AS ID 


XIX S3 


Tie 

X xc 


Glv 
vax y 


lie 


lie 


Ser 






180 












Pro 


Arg 


iyr 


gi v 


Met 




Thr 






1 Q5 










200 


trie 


Ser 


mr 


Thr* 
1 1 IX 


gi v 


Thr 

1 11 A. 


Ala 


Tvr 




210 










215 




pro 


TJ J — 

illS 


Fne 


Ca v 

ser 


val 


Ser 


lie 


Glu 


n ^ c 










4b J U 






LiyS 


Asn 


rile 


T \/C 
Xiys 


A en 




Glu 


lie 
















Asn 


L«ys 


vax 


vax 


J. ill. 


Glu 


Ala 


Asp 






260 










Glu 


Asp 


Leu 


Lys 


Asp 


Asp 




i<ys 




275 










^ O VJ 


Asn 


Thr 


Met 


Leu 


Tl a 

lie 


Asn 


uiy 


Tip 
lie 




290 










OOP 




Thr 


Ala 


val 


Lys 


Glu 


Leu 


Ser 


Tyr 


305 














Lys 


Tyr 


Leu 


Tyr 


Tl A 

lie 


Axa 


Val 


Thr- 
lilx 








32:> 








Ser 


Glu 


Glu 


ax a 


ulu 


lie 


Pro 


CI v 
oi y 


















Lys 


Leu 


Asn 


Leu 


vai 


Ala 

ax a 


inr 


_ 

pro 




355 










J w w 


Tyr 


Pro 


lie 


Lys 


vax 




vai 




370 










375 




Gxy 


vax 


Pro 


vax 


lie 


T .oil 


Asn 


Ala 


385 










3 90 






Thr 


Ser 


Asp 


Leu 


Asp 


JrX U 


Ser 


Lys 










405 








vax 


Ala 


Ser 


ir lie 


Vai 


Leu 


Asn 


Leu 








*» ^ U 










Pne 


Asn 


vai 


Lys 


rrk 
1 ILL 


Asp 


mx a 


Pro 






435 








440 


Arg 


Glu 


pi mm 

oiy 


Tyr 


Arg 


Ala 
Axa 


lie 


Ala 












455 




Leu 


Tyr 


lie 


Asp 


Trp 


1 11X 




Asn 


465 








AlCi 
*k 1 U 






His 


Leu 


Asn 


Tip 

xxe 


Tl f> 
XXc 


Val 


Thr 


Pro 










no 3 








Thr 


His 


Tyr 


Asn 


Tyr 


lieu 


Tie 

XXC 


Leu 






c nn 
5 UU 










Gly 


Thr 


Arg 


p1 >• 
wlU 




Dho 

Jrilc 


Cot* 

OCX 


A cn 




515 










520 


Pro 


val 


Thr 


uxn 


Asn 


ric u 


Val 


Pro 




530 










3 J 3 




lie 


Val 


Thr 


Gly 


CjXU 




1 111 


nXa 


545 
















Leu 


Asn 


lie 


Glu 


ulu 


Lys 


^ys 


wiy 










565 








Pro 


Asp 


Ala 


Asp 


Ala 


Tyr 


Ser 


Pro 






580 








Val 


Ala 


Thr 


Gly 


Met 


Asp 


Ser 


Trp 






595 










600 


Val 


Tyr 


Gly 


Val 


Gin 


Arg 


Gly 


Ala 




610 








615 




Gin 


Phe 


Leu 


Glu 


Lys 


Ser 


Asp 


Leu 



625 630 



•50- 



Pro 


Lvs 


Gin 


Leu Pro Gly Gly 


Gin 




90 








95 




Glu 


Val 


Val 


Ser 


Lys His 


Phe 


Ser 


105 








110 






Tvr 

i y i. 


Asd 


Asn 


Gly Phe Leu 


Phe 


He 










125 






Pro 


Aso 


Gin 


Ser 


Val Lys 


Val 


Arc 








1?U 








T \rc 
Xiy o 


Dm 


i"ixa 


Lys 


Arg Glu 


Thr 


Val 






155 








160 


Ser 


Glu 


Val 


Asp 


Met Val 


Glu 


Glu 




170 






175 




Phe 


Pro 


Asp 


Phe 


Lys lie 


Pro 


Ser 


185 








190 






lie 


Lys 


Ala 


Lys 


Tyr Lys 


Glu 


Asn 










205 






Phe 


Glu 


Val 
vax 


Lys 


Glu Tyr 


Val 


Leu 








220 








ri c 


Gin 


Tvr 


Asn 


Phe He Gly 


Tvr 

iy x 






235 








240 


Thr 


Tl e 

XXC 


Lvs 


Ala Arg Tyr 


Phe 


Tvr 




250 








255 




Val 


Tyr 


Tip 
XXC 


Thr 


Phe Gly 


He 




5 








270 






Gl n 


Mar 


Mat 

1*1 c u 


Gin 


Thr Ala 


Met 


Gin 










285 






n 1 a 
Ala 


r*l n 
win 


vax 


Thr 


Phe Asp 


Ser 


Glu 








300 








Tyr 


Ser 


Leu 


Glu Asp Leu 


Asn 


Hail 




31 5 








320 


vai 


Tl - 

lie 


blU 


Ser Thr Gly Gly 


lr I1C 












335 




Tl a 

ne 


Lys 


Tyr 


Val 


Leu Ser 


Pro 


xyx- 










350 






Leu 


pne 


Leu 


Lys 


Pro Gly He 












365 






Asp 


Car 


Leu 


Asp 


Gin Leu 


Val 


Glv 
uxy 








380 








win 


Thr 

XilX 


Tl a 

XXC 


Asp Val Asn Gin 


Glu 






7 9 5 

J 7 3 








400 


Ser 


Val 


Th t* 

i hi 


Arg 


Val Asp Asp 


Glv 
wxy 




410 








415 




Pro 


Cor* 


Gl V 
uiy 


Val 


Thr Val 


Leu 


Glu 


425 






430 






Asp 


L6U 


Pro 


Glu 


Glu Asn 


Gin 


Ala 








445 






iyr 


Ser 


Ser 


Leu 


Ser Gin 


Ser 


Tvr 






460 








AlS 


_ 

Lys 


Alo 


Leu 


Leu Val 


Gly 


Glu 




475 








480 


Lys 


Cor 


riO 


Tyr 


He Asp 


Lys 


He 












495 




Ser 


Lys 


biy 


Lys 


He He 


His 


Phe 


5U5 








510 






nla 


Car 

OCX 


iyr 


Gin 


Ser He 


Asn 


lie 








525 






JCi 


Ser 


Ar^ 


Leu 


Leu Val 


Tyr 


Tvr 








540 










Leu 


vax 


Ser Asp Ser 


Val 


Ttt^ 

irp 






5 55 
^ 3 w 








560 


Asn 


bin 


Leu 


Gin 


Val His 


Leu 


Ser 




570 








575 




Gi \f 


GT n 

UiU 


Thr 


Val 


Ser Leu 


Asn 


Met 


585 








590 






Ala 


Leu 


Ala 


Ala 


Val Asp 


Ser 


Ala 










605 






Lys 


Lys 


Pro 


Leu Glu Arg 


Val 


Phe 






620 








Gly 


Cys 


Gly 


Ala Gly Gly Gly 


Leu 




635 








640 
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Asn Asn Ala Asn Val Phe His Leu Ala Gly Leu Thr Phe Leu Thr Asn 

645 650 655 

Ala Asn Ala Asp Asp Ser Gin Glu Asn Asp Glu Pro Cys Lys Glu lie 

660 665 670 

Leu Arg Pro Arg Arg Thr Leu Gin Lys Lys lie Glu Glu lie Ala Ala 

675 680 " 685 

Lys Tyr Lys His Ser Val Val Lys Lys Cys Cys Tyr Asp Gly Ala Cys 

690 695 700 

Val Asn Asn Asp Glu Thr Cys Glu Gin Arg Ala Ala Arg lie Ser Leu 
705 710 715 720 

Gly Pro Arg Cys lie Lys Ala Phe Thr Glu Cys Cys Val Val Ala Ser 

725 730 735 

Gin Leu Arg Ala Asn He Ser His Lys Asp Met Gin Leu Gly Arg Leu 

740 745 750 

His Met Lys Thr Leu Leu Pro Val Ser Lys Pro Glu He Arg Ser Tyr 

755 760 765 

Phe Pro Glu Ser Trp Leu Trp Glu Val His Leu Val Pro Arg Arg Lys 

770 775 780 

Gin Leu Gin Phe Ala Leu Pro Asp Ser Leu Thr Thr Trp Glu He Gin 
785 790 795 800 

Gly He Gly He Ser Asn Thr Gly He Cys Val Ala Asp Thr Val Lys 

805 810 815 

Ala Lys Val Phe Lys Asp Val Phe Leu Glu Met Asn He Pro Tyr Ser 

820 825 830 

Val Val Arg Gly Glu Gin He Gin Leu Lys Gly Thr Val Tyr Asn Tyr 

835 840 845 

Arg Thr Ser Gly Met Gin Phe Cys Val Lys Met Ser Ala Val Glu Gly 

850 855 860 

lie Cys Thr Ser Glu Ser Pro Val He Asp His Gin Gly Thr Lys Ser 
865 870 875 880 

Ser Lys Cys Val Arg Gin Lys Val Glu Gly Ser Ser Ser His Leu Val 

865 890 695 

Thr Phe Thr Val Leu Pro Leu Glu He Gly Leu His Asn He Asn Phe 

900 905 9X0 

Ser Leu Glu Thr Trp Phe Gly Lys Glu He Leu Val Lys Thr Leu Arg 

915 920 925 

Val Val Pro Glu Gly Val Lys Arg Glu Ser Tyr Ser Gly Val Thr Leu 

930 935 940 

Asp Pro Arg Gly He Tyr Gly Thr He Ser Arg Arg Lys Glu Phe Pro 
945 950 955 960 

Tyr Arg He Pro Leu Asp Leu Val Pro Lys Thr Glu He Lys Arg He 

965 970 975 

Leu Ser Val Lys Gly Leu Leu Val Gly Glu He Leu Ser Ala Val Leu 

980 985 990 

Ser Gin Glu Gly He Asn He Leu Thr His Leu Pro Lys Gly Ser Ala 

995 1000 1005 

Glu Ala Glu Leu Met Ser Val Val Pro Val Phe Tyr Val Phe His Tyr 

1010 1015 1020 

Leu Glu Thr Gly Asn His Trp Asn He Phe His Ser Asp Pro Leu He 
025 1030 1035 1040 

Glu Lys Gin Lys Leu Lys Lys Lys Leu Lys Glu Gly Met Leu Ser He 

1045 1050 1055 

Met Ser Tyr Arg Asn Ala Asp Tyr Ser Tyr Ser Val Trp Lys Gly Gly 

1060 1065 1070 

Ser Ala Ser Thr Trp Leu Thr Ala Phe Ala Leu Arg Val Leu Gly Gin 

1075 1080 1085 

Val Asn Lys Tyr Val Glu Gin Asn Gin Asn Ser He Cys Asn Ser Leu 

1090 1095 1100 

Leu Trp Leu Val Glu Asn Tyr Gin Leu Asp Asn Gly Ser Phe Lys Glu 
105 1110 1115 1120 

Asn Ser Gin Tyr Gin Pro He Lys Leu Gin Gly Thr Leu Pro Val Glu 

112S 1130 1135 

Ala Arg Glu Asn Ser Leu Tyr Leu Thr Ala Phe Thr Val He Gly He 

1140 1145 1150 

Arg Lys Ala Phe Asp He Cys Pro Leu Val L He Asp Thr Ala Leu 

1155 1160 v 1165 

He Lys Ala Asp Asn Phe Leu Leu Glu Asn Thr Leu Pro Ala Gin Ser 

1170 1175 1180 

Thr Phe Thr Leu Ala He Ser Ala Tyr Ala Leu Ser Leu Gly Asp Lys 
185 1190 1195 1200 
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Thr His Pro Gin Phe Arg Ser He Val Ser Ala Leu Lys Arg Glu Ala 

1205 1210 1215 

Leu Val Lys Gly Asn Pro Pro He Tyr Arg Phe Trp Lys Asp Asn Leu 

1220 1225 1230 

Gin His Lys Asp Ser Ser Val Pro Asn Thr Gly Thr Ala Arg Met Val 

1235 1240 1245 

Glu Thr Thr Ala Tyr Ala Leu Leu Thr Ser Leu Asn Leu Lys Asp He 

1250 1255 1260 

Asn Tyr Val Asn Pro Val He Lys Trp Leu Ser Glu Glu Gin Arg Tyr 
265 1270 1275 1280 

Gly Gly Gly Phe Tyr Ser Thr Gin Asp Thr He Asn Ala He Glu Gly 

1285 1290 1295 

Leu Thr Glu Tyr Ser Leu Leu Val Lys Gin Leu Arg Leu Ser Met Asp 

1300 1305 1310 

He Asp Val Ser Tvr Lys His Lys Gly Ala Leu His Asn Tyr Lys Met 

1315 " 1320 1325 

Thr Asp Lys Asn Phe Leu Gly Arg Pro Val Glu Val Leu Leu Asn Asp 

1330 1335 1340 

Asp Leu He Val Ser Thr Gly Phe Gly Ser Gly Leu Ala Thr Val His 
345 1350 1355 1360 

Val Thr Thr Val Val His Lys Thr Ser Thr Ser Glu Glu Val Cys Ser 

1365 1370 1375 

Phe Tyr Leu Lys He Asp Thr Gin Asp He Glu Ala Ser His Tyr Arg 

1380 1385 1390 

Gly Tyr Gly Asn Ser Asp Tyr Lys Arg He Val Ala Cys Ala Ser Tyr 

1395 1400 1405 

Lys Pro Ser Arg Glu Glu Ser Ser Ser Gly Ser Ser His Ala Val Met 

1410 1415 1420 

Asp He Ser Leu Pro Thr Gly He Ser Ala Asn Glu Glu Asp Leu Lys 
425 1430 1435 1440 

Ala Leu Val Glu Gly Val Asp Gin Leu Phe Thr Asp Tyr Gin He Lys 

1445 1450 1455 

Asp Gly His Val He Leu Gin Leu Asn Ser He Pro Ser Ser Asp Phe 

1460 1465 1470 

Leu Cys Val Arg Phe Arg He Phe Glu Leu Phe Glu Val Gly Phe Leu 

1475 1480 1485 

Ser Pro Ala Thr Phe Thr Val Tyr Glu Tyr His Arg Pro Asp Lys Gin 

1490 1495 1500 

Cys Thr Met Phe Tyr Ser Thr Ser Asn He Lys He Gin Lys Val Cys 
505 1510 1515 1520 

Glu Gly Ala Ala Cvs Lys Cys Val Glu Ala Asp Cys Gly Gin Met Gin 

1525 1530 1535 

Glu Glu Leu Asp Leu Thr He Ser Ala Glu Thr Arg Lys Gin Thr Ala 

1540 1545 1550 

Cys Lys Pro Glu He Ala Tyr Ala Tyr Lys Val Ser He Thr Ser lie 

1555 1560 1565 

Thr Val Glu Asn Val Phe Val Lys Tyr Lys Ala Thr Leu Leu Asp He 

1570 1575 1580 

Tyr Lys Thr Gly Glu Ala Val Ala Glu Lys Asp Ser Glu He Thr Phe 
585 1590 1595 1600 

He Lys Lys Val Thr Cys Thr Asn Ala Glu Leu Val Lys Gly Arg Gin 

1605 1610 1615 

Tyr Leu He Met Gly Lys Glu Ala Leu Gin He Lys Tyr Asn Phe Ser 

1620 1625 " 1630 

Phe Arg Tyr He Tyr Pro Leu Asp Ser Leu Thr Trp He Glu Tyr Trp 

1635 1640 1645 

Pro Arg Asp Thr Thr Cys Ser Ser Cys Gin Ala Phe Leu Ala Asn Leu 

1650 * 1655 " 1660 

Asp Glu Phe Ala Glu Asp He Phe Leu Asn Gly Cys 
665 1670 1675 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 amino acids 

(B) TYPE: amino acid 

<C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:12: 



Met 


Gly 


Leu 


Trp 


l 








Trp 


Gly 


Gin 


Glu 








20 


Val 


Gly 


Ser 


Ser 






35 




Ala 


Phe 


Asp 


Ala 




50 






Thr 


Phe 


Ser 


Ser 


65 








Asn 


Ala 


Ala 


Leu 


Ser 


Pro 


Val 


Ser 








100 


Lys 


Ser 


Lys 


Lys 






115 




His 


Thr 


Asp 


Lvs 




130 






Val 


Tyr 


Ser 


Leu 


145 








Leu 


Thr 


Phe 


He 


Asn 


Asp 


Tyr 


Thr 








180 


Asn 


Pro 


Lys 


Tyr 






195 






Thr 


Thr 


Thr 




210 






Pro 


Arg 


Phe 


Ser 


225 








Lys 


Asn 


Phe 


Lys 


Asn 


Lys 


Val 


Val 








260 


Glu 


Asp 


He 


Lys 






275 




Ala 


A "I ■= 
Aid 


Lys 






290 






Thr 


Ala 


Val 


Lys 


305 








Lys 


Tyr 


Leu 


Tyr 


Ser 


Glu 


Glu 


Ala 








340 


Thr 


Leu 


Asn 


Leu 






355 




rfie 


Ser 


lie 


Lys 




370 






Gly 


Val 


Pro 


Val 


385 








Thr 


Ser 


Asp 


Leu 


Val 


Ala 


Val 


Phe 








420 


Phe 


Glu 


He 


Arg 






435 




Ser 


Lys 


Glu 


Tyr 




450 






He 


Tyr 


He 


Ala 


465 








Tyr 


Leu 


Asn 


He 


Thr 


His 


Tyr 


Asn 



Gly 


He 


Leu 


Cys 


5 








Gin 


Thr 


Tyr 


Val 


Glu 


Asn 


val 


Val 








40 


Thr 


Leu 


Ser 


Leu 






55 




Glv 


Tvr 


Val 


Asn 




70 






Leu 


Thr 


Leu 


Gin 


85 








His 


Val 


Tyr 


Leu 


He 


Pro 


lie 


Thr 








120 


Pro 


Val 


Tyr 


Thr 






135 




Glv 


Asp 


Asp 


Leu 




150 






Asp 


Pro 


Glu 


Gly 


165 








Gly 


He 


He 


Ser 


Gly 


val 


Trp 


Thr 








200 


Gly 


Thr 


Ala 


Tyr 






215 




Val 


Ser 


He 


Glu 




230 






Asn 


Phe 


Glu 


He 


245 








Pro 


Asp 


Ala 


Glu 


Asp 


Glu 


Glu 


Lys 








280 


Val 


Asp 


Gly 


Val 






295 




OX li 




Car 
OCX 






310 






He 


Ala 


Val 


Thr 


325 








Glu 


He 


Pro 


Gly 


Val 


Ala 


Thr 


Pro 








360 


Ala 


Gin 


Val 


Lys 






375 




Thr 


Leu 


Met 


Ala 




390 






Glu 


Thr 


Lys 


Arg 


405 








Val 


Leu 


Asn 


Leu 


Thr 


Asp 


Asp 


Pro 








440 


Glu 


Ala 


Val 


Ala 






455 




Trp 


Thr 


Glu 


Asn 




470 






Met 


val 


Thr 


Pro 


485 








Tyr 


Leu 


He 


Leu 



Leu 


Leu 


He 


Phe 




10 






He 


Ser 


Ala 


Pro 


25 








He 


Gin 


Val 


His 


Lys 


Ser 


Tyr 


Pro 








60 


Leu 


Ser 


Pro 


GlU 






75 




Pro 


Asn 


Gin 


Val 




90 






Glu 


Val 


Val 


Ser 


105 








Tyr 


Asn 


Asn 


Gly 


Pro 


Asp 


Gin 


Ser 








140 


Lys 


Pro 


Ala 


Lys 






155 




C tfi 
OCX 


U1U 


VAI 


AOL/ 




170 






Phe 


Pro 


Asp 


Phe 


185 








He 


Lys 


Ala 


Asn 


Phe 


Glu 


He 


Lys 








220 


Leu 


Glu 


Arg 


Thr 






235 




Thr 


Vol 


T \r<z 






250 






Val 


Tyr 


Ala 


Phe 


265 








Gin 


Met 


Met 


His 


Ala 


Gin 


He 


Ser 








300 


Asn 


Ser 


Leu 


Glu 






315 




vai 


xnr 


Pin 


ser 




330 






Val 


Lys 


Tyr 


Val 


345 








Leu 


Phe 


Val 


Lys 


Asp 


Ser 


Leu 


Glu 








380 


Gin 


Thr 


Val 


Asp 






395 




Ser 


He 


Thr 


His 




410 






Pro 


Ser 


Asn 


Val 


425 








Glu 


Leu 


Pro 


Glu 


Tyr 


Ser 


Ser 


Leu 








460 


Tyr 


Lys 


Pro 


Met 






475 




Lys 


Ser 


Pro 


Tyr 




490 






Ser 


Lys 


Gly 


Lys 



Leu Asp Lvs Thr 
15 

Lys He Leu Arg 
30 

Gly Tyr Thr Glu 
45 

Asp Lys Lys Val 

Asn Lys Phe Gin 
80 

Pro Arg Glu Glu 
95 

Lys His Phe Ser 
110 

He Leu Phe He 
125 

Val Lys He Arg 

Arg Glu Thr Val 
160 

He Val Glu Glu 
175 

Lys He Pro Ser 
190 

Tyr Lys Lys Asp 
205 

Glu Tyr Val Leu 

Phe He Gly Tyr 
240 

Arg Tyr Phe Tyr 
255 

Phe Gly Leu Arg 
270 

Lys Ala Thr Gin 
285 

Phe Asp Ser Glu 

Asp Leu Asn Asn 
320 

Ser Gly Gly Phe 
335 

Leu Ser Pro Tyr 
350 

Pro Gly He Pro 
365 

Gin Ala Val Gly 

Val Asn Gin Glu 
400 

Asp Thr Asp Gly 
415 

Thr Val Leu Lys 
430 

Glu Asn Gin Ala 
445 

Ser Gin Ser Tyr 

Leu Val Gly Glu 
480 

He Asp Lys He 
495 

He Val Gin Tyr 
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500 505 510 

Gly Thr Arg Glu Lys Leu Phe Ser Ser Thr Tyr Gin Asn lie Asn lie 

515 520 525 

Pro Val Thr Gin Asn Met Val Pro Ser Ala Arg Leu Leu Val Tyr Tyr 
530 535 540 

* lie Val Thr Gly Glu Gin Thr Ala Glu Leu Val Ala Asp Ala Val Trp 
545 550 555 560 

lie Asn lie Glu Glu Lys Cys Glv Asn Gin Leu Gin Val His Leu Ser 

565 " 570 575 

Pro Asp Glu Tyr Val Tyr Ser Pro Gly Gin Thr Val Ser Leu Asp Met 

580 585 590 

Val Thr Glu Ala Asp Ser Trp Val Ala Leu Ser Ala Val Asp Arg Ala 

595 " 600 605 

Val Tyr Lys Val Gin Gly Asn Ala Lys Arg Ala Met Gin Arg Val Phe 

610 615 620 

Gin Ala Leu Asp Glu Lys Ser Asp Leu Gly Cys Gly Ala Gly Gly Gly 
625 630 635 640 

His Asp Asn Ala Asp Val Phe His Leu Ala Gly Leu Thr Phe Leu Thr 

645 650 655 

Asn Ala Asn Ala Asp Asp Ser His Tyr Arg Asp Asp Ser Cys Lys Glu 

660 665 670 

lie Leu Arg Ser Lys Arg Asn Leu His Leu Leu Arg Gin Lys lie Glu 

675 680 685 

Glu Gin Ala Ala Lys Tyr Lys His Ser Val Pro Lys Lys Cys Cys Tyr 

690 695 700 

Asp Gly Ala Arg Val Asn Phe Tyr Glu Thr Cys Glu Glu Arg Val Ala 
705 710 715 720 

Arg Val Thr lie Gly Pro Leu Cys lie Arg Ala Phe Asn Glu Cys Cys 

725 730 735 

Thr lie Ala Asn Lys lie Arg Lys Glu Ser Pro His Lys Pro Val Gin 

740 745 750 

Leu Gly Arg He His He Lys Thr Leu Leu Pro Val Met Lys Ala Asp 

755 760 765 

He Arg Ser Tyr Phe Pro Glu Ser Trp Leu Trp Glu He His Arg Val 

770 775 780 

Pro Lys Arg Lys Gin Leu Gin Val Thr Leu Pro Asp Ser Leu Thr Thr 
785 790 795 800 

Trp Glu He Gin Gly He Gly He Ser Asp Asn Gly He Cys Val Ala 

805 810 815 

Asp Thr Leu Lys Ala Lys Val Phe Lys Glu Val Phe Leu Glu Met Asn 

820 825 830 

He Pro Tyr Ser Val Val Arg Gly Glu Gin He Gin Leu Lys Gly Thr 

835 840 845 

Val Tyr Asn Tyr Met Thr Ser Gly Thr Lys Phe Cys Val Lys Met Ser 

850 855 860 

Ala Val Glu Gly He Cys Thr Ser Gly Ser Ser Ala Ala Ser Leu His 
865 870 875 880 

Thr Ser Arg Pro Ser Arg Cys Val Phe Gin Arg He Glu Gly Ser Ser 

885 890 895 

Ser His Leu Val Thr Phe Thr Leu Leu Pro Leu Glu He Gly Leu His 

900 905 910 

Ser He Asn Phe Ser Leu Glu Thr Ser Phe Gly Lys Asp He Leu Val 

915 920 925 

Lys Thr Leu Arg Val Val Pro Glu Gly Val Lys Arg Glu Ser Tyr Ala 

930 935 " 940 

Gly Val He Leu Asp Pro Lys Gly He Arg Gly He Val Asn Arg Arg 
945 950 955 960 

Lys Glu Phe Pro Tyr Arg He Pro Leu Asp Leu Val Pro Lys Thr Lys 

965 970 975 

Val Glu Arg He Leu Ser Val Lys Gly Leu Leu Val Gly Glu Phe Leu 

980 985 990 

Ser Thr Val Leu Ser Lys Glu Gly He Asn He Leu Thr His Leu Pro 

995 1000 1005 

Lys Gly Ser Ala Glu Ala Glu Leu Met Ser 11° Ala Pro Val Phe Tyr 

1010 1015 1020 

Val Phe His Tyr Leu Glu Ala Gly Asn His Tr; Asn He Phe Tyr Pro 
025 1030 1035 1040 

Asp Thr Leu Ser Lys Arg Gin Ser Leu Glu Lys Lys He Lys Gin Gly 

1045 1050 1055 

Val Val Ser Val Met Ser Tyr Arg Asn Ala Asp Tyr Ser Tyr Ser Met 
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1060 1065 1070 

Trp Lys Gly Ala Ser Ala Ser Thr Trp Leu Thr Ala Phe Ala Leu Arg 

1075 1080 1085 

Val Leu Gly Gin Val Ala Lys Tyr Val Lys Gin Asp Glu Asn Ser lie 

1090 1095 1100 

Cys Asn Ser Leu Leu Trp Leu Val Glu Lys Cys Gin Leu Glu Asn Gly 
105 1110 1115 1120 

Ser Phe Lys Glu Asn Ser Gin Tyr Leu Pro lie Lys Leu Gin Gly Thr 

1125 1130 1135 

Leu Pro Ala Glu Ala Gin Glu Lys Thr Leu Tyr Leu Thr Ala Phe Ser 

1140 1145 1150 

Val He Gly He Arg Lys Ala Val Asp He Cys Pro Thr Met Lys He 

1155 1160 1165 

His Thr Ala Leu Asp Lys Ala Asp Ser Phe Leu Leu Glu Asn Thr Leu 

1170 1175 1180 

Pro Ser Lys Ser Thr Phe Thr Leu Ala He Val Ala Tyr Ala Leu Ser 
185 H90 1195 1200 

Leu Gly Asp Arg Thr His Pro Arg Phe Arg Leu He Val Ser Ala Leu 

1205 1210 1215 

Arg Lys Glu Ala Phe Val Lys Gly Asp Pro Pro He Tyr Arg Tyr Trp 

1220 1225 1230 

Arg Asp Thr Leu Lys Arg Pro Asp Ser Ser Val Pro Ser Ser Gly Thr 

1235 1240 1245 

Ala Gly Met Val Glu Thr Thr Ala Tyr Ala Leu Leu Ala Ser Leu Lys 

1250 1255 1260 

Leu Lys Asp Met Asn Tyr Ala Asn Pro He He Lys Trp Leu Ser Glu 
265 1270 1275 1280 

Glu Gin Arg Tyr Gly Gly Gly Phe Tyr Ser Thr Gin Asp Thr He Asn 

1285 1290 1295 

Ala He Glu Gly Leu Thr Glu Tyr Ser Leu Leu Leu Lys Gin He His 

1300 1305 1310 

Leu Asp Met Asp He Asn Val Ala Tyr Lys His Glu Gly Asp Phe His 

1315 1320 1325 

Lys Tyr Lys Val Thr Glu Lys His Phe Leu Gly Arg Pro Val Glu Val 

1330 1335 1340 

Ser Leu Asn Asp Asp Leu Val Val Ser Thr Gly Tyr Ser Ser Gly Leu 
345 1350 1355 1360 

Ala Thr Val Tyr Val Lys Thr Val Val His Lys He Ser Val Ser Glu 

1365 1370 1375 

Glu Phe Cys Ser Phe Tyr Leu Lys He Asp Thr Gin Asp He Glu Ala 

1380 1385 1390 

Ser Ser His Phe Arg Leu Ser Asp Ser Gly Phe Lys Arg He He Ala 

1395 1400 1405 

Cys Ala Ser Tyr Lys Pro Ser Lys Glu Glu Ser Thr Ser Gly Ser Ser 

1410 1415 1420 

His Ala Val Met Asp He Ser Leu Pro Thr Gly He Gly Ala. Asn Glu 
425 1430 1435 1440 

Glu Asp Leu Arg Ala Leu Val Glu Gly Val Asp Gin Leu Leu Thr Asp 

1445 1450 1455 

Tyr Gin He Lys Asp Gly His Val He Leu Gin Leu Asn Ser He Pro 

1460 1465 1470 

Ser Arg Asp Phe Leu Cys Val Arg Phe Arg He Phe Glu Leu Phe Gin 

1475 1480 1485 

Val Gly Phe Leu Asn Pro Ala Thr Phe Thr Val Tyr Glu Tyr His Arg 

1490 1495 1500 

Pro Asp Lys Gin Cys Thr Met He Tyr Ser He Ser Asp Thr Arg Leu 
505 1510 1515 1520 

Gin Lys Val Cys Glu Gly Ala Ala Cys Thr Cys Val Glu Ala Asp Cys 

1525 1530 1535 

Ala Gin Leu Gin Ala Glu Val Asp Leu Ala He Ser Ala Asp Ser Arg 

1540 1545 1550 

Lys Glu Lys Ala Cys Lys Pro Glu Thr Ala Tyr Ala Tyr Lys val Arg 

1555 1560 1565 

He Thr Ser Ala Thr Glu Glu Asn Val Phe Val Lys Tyr Thr Ala Thr 

1570 1575 1580 

Leu Leu Val Thr Tyr Lys Thr Gly Glu Ala Ala Asp Glu Asn Ser Glu 
585 1590 1595 1600 

Val Thr Phe He Lys Lys Met Ser Cys Thr Asn Ala Asn Leu Val Lys 

1605 1610 1615 

Gly Lys Gin Tyr Leu He Met Gly Lys Glu Val Leu Gin He Lys His 
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1620 1625 1630 

Asn Phe Ser Phe Lys Tyr lie Tyr Pro Leu Asp Ser Ser Thr Trp He 

1635 1640 1645 

Glu Tyr Trp Pro Thr Asp Thr Thr Cys Pro Ser Cys Gin Ala Phe Val 

1650 "l655 1660 

Glu Asn Leu Asn Asn Phe Ala Glu Asp Leu Phe Leu Asn Ser Cys Glu 
665 1670 1675 1680 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1235 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Met Phe 


Ser 


Gly 


1 

Ala Arg 


Ala 


Ala 






20 


Gly Arg 


Gly 


Pro 




35 




Leu Ala 


Pro 


Val 


50 






His Thr 


Tyr 


Tyr 


65 






Val Leu 


Asp 


Glu 


Gly His 


Leu 


Lys 






100 


Asp Val 


Leu 


Arg 




115 




Leu Trp 


Gly 


Gly 


130 






Thr Val 


Phe 


His 


145 






Gly Met 


Arg 


Ala 


Pro Thr 


Gly 


Thr 






180 


Arg Val 


Ala 


Val 




195 




Lys Glu 


Glu 


Val 


210 






Cys Glu 


Arg 


Met 


225 






Arg Gly 


He 


Ser 


Asp Val 


Tyr 


Tyr 






260 


Val Arg 


Ser 


Gly 




275 




Ala He 


Lys 


Lys 


290 






Leu Asp 


Asn 


Pro 


305 






Gly Arg 


Asn 


Asn 


Thr Ser 


Ser 


Asp 






340 


Glu Gly 


Gly 


Met 




355 





Gly 


Gly 


Gly 


Pro 


5 








Ser 


Gly 


Phe 


Phe 


Pro 


Pro 


Cys 


Leu 








40 


Gly 


Thr 


Gin 


Gin 






55 




Ser 


Glu 


Cys 


Asp 




70 






Asp 


Ala 


Pro 


Pro 


85 








Arg 


Ala 


Pro 


Lys 


Val 


Gly 


Ser 


Gly 








120 


Val 


Asp 


His 


Ala 






135 




Val 


Tvr 


Asp 


lie 




150 






Ala 


Gin 


Phe 


His 


165 








Val 


He 


Thr 


Leu 


His 


Val 


Tyr 


Gly 








200 


Asp 


Arg 


His 


Leu 






215 




Ala 


Ala 


Ala 


Leu 




230 






Ala 


Asp 


His 


Phe 


245 








Tyr 


Glu 


Thr 


Arg 


Arg 


Val 


Leu 


Ser 








280 


Tyr 


Glu 


Gly 


Gly 






295 




Gly 


Phe 


Val 


Thr 




310 






Thr 


Leu 


Ala 


Gin 


325 








Val 


Glu 


Phe 


Asn 


Ser 


Asp 


Leu 


Pro 








360 



Leu 


Ser Pro 


Gly 




10 




Ala 


Pro Ala 


Gly 


25 






Arg 


Gin Asn 


Phe 


Lys 


Pro Thr 


Gly 






60 


Glu 


Phe Arg 


Phe 




75 




Glu 


Lys Arg 


Ala 




90 




Val 


Tyr Cys 


Gly 


105 






Gly 


Phe Trp 


Pro 


Pro 


Ala Gly 


Phe 






140 


Leu 


Glu Asn 


Val 




155 




Ala 


Arg Phe 


Met 




170 




Leu 


Gly Leu 


Thr 


185 






Thr 


Arg Gin 


Tyr 


Gin 


Cys Arg 


Ala 






220 


Arg 


Glu Ser 


Pro 




235 




Glu 


Ala Glu 


Val 




250 




Pro 


Ala Leu 


Phe 


265 






Tyr 


Leu Cys 


Asp 


Val 


Asp Ala 


Thr 






300 


Phe 


Gly Trp 


Tyr 




315 




Pro 


Arg Ala 


Pro 




330 




Cvs 


Thr Ala 


Asp 


345 






Ala 


Tyr Lys 


Leu 



Gly Lys Ser Ala 
15 

Pro Arg Gly Ala 
30 

Tyr Asn Pro Tyr 
45 

Pro Thr Gin Arg 

He Ala Pro Arg 
80 

Gly Val His Asp 
95 

Gly Asp Glu Arg 
110 

Arg Arg Ser Arg 
125 

Asn Pro Thr Val 

Glu His Ala Tyr 
160 

Asp Ala He Thr 
175 

Pro Glu Gly His 
190 

Phe Tyr Met Asn 
205 

Pro Arg Asp Leu 

Gly Ala Ser Phe 
240 

Val Glu Arg Thr 
255 

Tyr Arg Val Tyr 
270 

Asn Phe Cys Pro 
285 

Thr Arg Phe He 

Arg Leu Lys Pro 
320 

Met Ala Phe Gly 
335 

Asn Leu Ala He 
350 

Met Cys Phe Asp 
365 
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lie Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val Ala 

370 375 380 

Gly His Pro Glu Asp Leu Val lie Gin lie Ser Cys Leu Leu Tyr Asp 
385 390 395 400 

Leu Ser Thr Thr Ala Leu Glu His Val Leu Leu Phe Ser Leu Gly Ser 

405 410 415 

Cys Asp Leu Pro Glu Ser His Leu Asn Glu Leu Ala Ala Arg Gly Leu 

420 425 430 

Pro Thr Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu Leu 

435 440 445 

Ala Phe Met Thr Leu Val Lys Gin Tyr Gly Pro Glu Phe Val Thr Gly 

450 455 460 

Tyr Asn lie lie Asn Phe Asp Trp Pro Phe Leu Leu Ala Lys Leu Thr 
465 470 475 480 

Asp He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly Arg 

485 490 495 

Gly Val Phe Arg Val Trp Asp He Gly Gin Ser His Phe Gin Lys Arg 

500 505 510 

Ser Lys He Lys Val Asn Gly Met Val Asn He Asp Met Tyr Gly He 

515 520 525 

He Thr Asp Lys He Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val Ala 

530 535 540 

Glu Ala Val Leu Lys Asp Lys Lys Lys Asp Leu Ser Tyr Arg Asp He 
545 550 555 560 

Pro Ala Tyr Tyr Ala Thr Gly Pro Ala Gin Arg Gly Val lie Gly Glu 

565 570 575 

Tyr Cys He Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys Phe 

580 585 590 

Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly He Asn 

595 600 605 

He Thr Arg Thr He Tyr Asp Gly Gin Gin He Arg Val Phe Thr Cys 

610 615 620 

Leu Leu Arg Leu Ala Asp Gin Lys Gly Phe He Leu Pro Asp Thr Gin 
625 630 635 640 

Gly Arg Phe Arg Gly Ala Gly Gly Glu Ala Pro Lys Arg Pro Ala Ala 

645 650 655 

Ala Arg Glu Asp Glu Glu Arg Pro Glu Glu Glu Gly Glu Asp Glu Asp 

660 665 670 

Glu Arg Glu Glu Gly Gly Gly Glu Arg Glu Pro Glu Gly Ala Arg Glu 

675 680 685 

Thr Ala Gly Arg His Val Gly Tyr Gin Gly Ala Lys Val Leu Asp Pro 

690 695 700 

Thr Ser Gly Phe His Val Asn Pro Val Val Val Phe Asp Phe Ala Ser 
705 710 715 720 

Leu Tyr Pro Ser He lie Gin Ala His Asn Leu Cys Phe Ser Thr Leu 

725 730 735 

Ser Leu Arg Ala Asp Ala Val Ala His Leu Glu Ala Gly Lys Asp Tyr 

740 " 745 750 

Leu Glu He Glu Val Gly Gly Arg Arg Leu Phe Phe Val Lys Ala His 

755 760 765 

Val Arg Glu Ser Leu Leu Ser He Leu Leu Arg Asp Trp Leu Ala Met 

770 775 780 

Arg Lys Gin He Arg Ser Arg He Pro Gin Ser Ser Pro Glu Glu Ala 
785 790 795 800 

Val Leu Leu Asp Lys Gin Gin Ala Ala He Lys Val Val Cys Asn Ser 

805 810 815 

Val Tyr Gly Phe Thr Gly Val Gin His Gly Leu Leu Pro Cys Leu His 

820 825 830 

Val Ala Ala Thr Val Thr Thr He Gly Arg Glu Met Leu Leu Ala Thr 

835 840 645 

Arg Glu Tyr Val His Ala Arg Trp Ala Ala Phe Glu Gin Leu Leu Ala 

850 855 860 

Asp Phe Pro Glu Ala Ala Asp Met Arg Ala Pro Gly Pro Tyr Ser Met 
865 870 875 880 

Arg He He Tyr Gly Asp Thr Asp Ser He Phe Val Leu Cys Arg Gly 

885 890 895 

Leu Thr Ala Ala Gly Leu Thr Ala Met Gly Asp Lys Met Ala Ser His 

900 905 910 

He Ser Arg Ala Leu Phe Leu Pro Pro lie Lys Leu Glu Cys Glu Lys 
915 920 925 
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Thr 


Lys 


Leu 


Leu 


Leu 


He 


Ala 


Lys 


Lys 


Lys 


Tyr 


He Gly 


Val 


930 








935 










94 0 




Val Arg 




lie Tyx 


Gly 


Gly 


Lys 


Met 


Leu 


He 


Lys 


Gly 


Val 


Asp 


Leu 


Lys 


945 






950 










955 








960 


Asn Asn 


Cvs 


Ala 


Phe 


He 


Asn 


Arg 


Thr 


Ser 


Arg 


Ala 


Leu 


Val Asp 


Leu 






965 










970 








975 




Leu phe 


Tvr 


Asd 


Asp 


Thr 


Val 


Ser 


Gly 


Ala 


Ala 


Ala 


Ala 


Leu Ala 


Glu 




980 










985 










990 




Arg Pro 


Ala 


Glu 


Glu 


Trp 


Leu 


Ala 


Arg 


Pro 


Leu 


Pro 


Glu 


Gly Leu 


Gin 


995 








1000 








1005 






Ala Phe 


Gly 


Ala 


Val 


Leu 


Val 


Asp 


Ala 


His 


Arg Arg 


He 


Thr Asp 


Pro 


1010 






1015 








1020 










Asp 


He 


Gin 


Asp 


Phe 


Val 


Leu 


Thr 


Ala 


Glu 


Leu 


Ser Arg 


His 


025 




1030 








1035 






1040 


Pro Arg 


Ala 


Tyr 


Thr 


Asn 


Lys 


Arg 


Leu 


Ala 


His 


Leu 


Thr 


Val Tyr Tyr 




L045 








1050 








10S5 




Lys Leu 


Met 


Ala Arg Arg 


Ala 


Gin 


Val 


Pro 


Ser 


lie 


Lys 


Asp Arg 


He 


1060 








1065 








1070 




Pro Tyr 

1 


Val 


He 


Val 


Ala 


Gin 


Thr Arg Glu Val 


Glu 


Glu 


Thr Val 


Ala 


L075 








1080 








1085 







Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala Ala Ala Pro Gly Asp Glu 

1090 1095 1100 

Pro Ala Pro Pro Ala Ala Leu Pro Ser Pro Ala Lys Arg Pro Arg Glu 
105 1H0 1115 1120 

Thr Pro Ser His Ala Asp Pro Pro Gly Gly Ala Ser Lys Pro Arg Lys 

1125 1130 1135 

Leu Leu Val Ser Glu Leu Ala Glu Asp Pro Ala Tyr Ala He Ala His 

1140 H45 1150 

Gly Val Ala Leu Asn Thr Asp Tyr Tyr Phe Ser His Leu Leu Gly Ala 

1155 1160 1165 

Ala Cys Val Thr Phe Lys Ala Leu Phe Gly Asn Asn Ala Lys He Thr 

1170 1175 1180 

Glu Ser Leu Leu Lys Arg Phe He Pro Glu Val Trp His Pro Pro Asp 
IBS 1190 1195 1200 

Asp Val Ala Ala Arg Leu Arg Ala Ala Gly Phe Gly Ala Val Gly Ala 

1205 1210 1215 

Gly Ala Thr Ala Glu Glu Thr Arg Arg Met Leu His Arg Ala Phe Asp 
1220 1225 1230 

Thr Leu Ala 
1235 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 





<vi) ORIGINAL 


SOURCE : 


















(xiJ SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 14: 






Met 


Phe 


Cys 


Ala 


Ala 


Gly 


Gly 


Pro 


Ala 


Ser 


Pro 


Gly Gly 


Lys Ser 


Ala 


1 






5 










10 






15 




Ala 


Arg 


Ala 


Ala 


Ser 


Gly 


Phe 


Phe 


Ala 


Pro 


His 


Asn Pro 


Arg Gly 


Ala 






20 










25 








30 




Thr 


Gin 


Thr 


Ala 


Pro 


Pro 


Pro 


Cys 


Arg 


Arg 


Gin 


Asn Phe 


Tyr Asn 


Pro 






35 










40 








45 






His 


Leu 


Ala 


Gin 


Thr 


Gly 


Thr 


Gin 


Pro 


Lys 


Ala 


Pro Gly 


Pro Ala 


Gin 




50 








55 










60 






Arg 


His 


Thr 


Tyr 


Tyr 


Ser 


Glu 


Cys 


Asp 


Glu 


Phe 


Arg Phe 


He Ala 


Pro 


65 








70 










75 






80 


Arg 


Ser 


Leu 


Asp 


Glu 


Asp 


Ala 


Pro 


Ala 


Glu 


Gin 


Arg Thr 


Gly Val 


His 






85 










90 






95 




Asp 


Gly 


Arg 


Leu 


Arg 


Arg 


Ala 


Pro 


Lys 


Val 


Tyr 


Cys Gly 


Gly Asp Glu 
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100 105 110 

Arg Asp Val Leu Arg Val Gly' Pro Glu Gly Phe Trp Pro Arg Arg Leu 

115 120 125 

Arg Leu Trp Gly Gly Ala Asp His Ala Pro Glu Gly Phe Asp Pro Thr 

130 135 140 

Val Thr Val Phe His Val Tyr Asp He Leu Glu His Val Glu His Ala 
145 150 155 160 

Tyr Ser Met Arg Ala Ala Gin Leu His Glu Arg Phe Met Asp Ala He 

165 170 175 

Thr Pro Ala Gly Thr Val He Thr Leu Leu Gly Leu Thr Pro Glu Gly 

180 1B5 190 

His Arg Val Ala Val His Val Tyr Gly Thr Arg Gin Tyr Phe Tyr Met 

195 200 205 

Asn Lys Ala Glu Val Asp Arg His Leu Gin Cys Arg Ala Pro Arg Asp 

210 215 220 

Leu Cys Glu Arg Leu Ala Ala Ala Leu Arg Glu Ser Pro Gly Ala Ser 
225 230 235 240 

Phe Arg Gly He Ser Ala Asp His Phe Glu Ala Glu Val Val Glu Arg 

245 250 255 

Ala Asp Val Tyr Tyr Tyr Glu Thr Arg Pro Thr Leu Tyr Tyr Arg Val 

260 265 270 

Phe Val Arg Ser Gly Arg Ala Leu Ala Tyr Leu Cys Asp Asn Phe Cys 

275 280 285 

Pro Ala He Arg Lys Tyr Glu Gly Gly Val Asp Ala Thr Thr Arg Phe 

290 295 300 

He Leu Asp Asn Pro Gly Phe Val Thr Phe Gly Trp Tyr Arg Leu Lys 
305 310 315 320 

Pro Gly Arg Gly Asn Ala Pro Ala Gin Pro Arg Pro Pro Thr Ala Phe 

325 330 335 

Gly Thr Ser Ser Asp Val Glu Phe Asn Cys Thr Ala Asp Asn Leu Ala 

340 345 350 

Val Glu Gly Ala Met Cys Asp Leu Pro Ala Tyr Lys Leu Met Cys Phe 

355 360 365 

Asp He Glu Cys Lys Ala Gly Gly Glu Asp Glu Leu Ala Phe Pro Val 

370 375 380 

Ala Glu Arg Pro Glu Asp Leu Val He Gin He Ser Cys Leu Leu Tyr 
385 390 395 400 

Asp Leu Ser Thr Thr Ala Leu Glu His He Leu Leu Phe Ser Leu Gly 

405 410 415 

Ser Cys Asp Leu Pro Glu Ser His Leu Ser Asp Leu Ala Ser Arg Gly 

420 425 430 

Leu Pro Ala Pro Val Val Leu Glu Phe Asp Ser Glu Phe Glu Met Leu 

435 440 445 

Leu Ala Phe Met Thr Phe Val Lys Gin Tyr Gly Pro Glu Phe Val Thr 

450 455 460 

Gly Tyr Asn He He Asn Phe Asp Trp Pro Phe Val Leu Thr Lys Leu 
465 470 475 480 

Thr Glu He Tyr Lys Val Pro Leu Asp Gly Tyr Gly Arg Met Asn Gly 

485 490 495 

Arg Gly Val Phe Arg Val Trp Asp He Gly Gin Ser His Phe Gin Lys 

500 505 510 

Arg Ser Lys He Lys Val Asn Gly Met Val Asn He Asp Met Tyr Gly 

515 520 525 

He He Thr Asp Lys Val Lys Leu Ser Ser Tyr Lys Leu Asn Ala Val 

530 535 540 

Ala Glu Ala Val Leu Lvs Asp Lys Lvs Lys Asp Leu Ser Tyr Arg Asp 
545 550 555 560 

He Pro Ala Tyr Tyr Ala Ser Gly Pro Ala Gin Arg Gly Val He Gly 

565 570 575 

Glu Tyr Cys Val Gin Asp Ser Leu Leu Val Gly Gin Leu Phe Phe Lys 

580 585 590 

Phe Leu Pro His Leu Glu Leu Ser Ala Val Ala Arg Leu Ala Gly He 

595 600 605 

Asn He Thr Arg Thr He Tyr Asp Gly Gin Gin He Arg Val Phe Thr 

610 615 620 

Cys Leu Leu Arg Leu Ala Gly Gin Lys Gly Phe He Leu Pro Asp Thr 
625 630 ' 635 640 

Gin Gly Arg Phe Arg Gly Leu Asp Lys Glu Ala Pro Lys Arg Pro Ala 

645 650 655 

Val Pro Arg Gly Glu Gly Glu Arg Pro Gly Asp Gly Asn Gly Asp Glu 
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Arg Val Leu Asp Pro Thr Ser Gly Phe His Val Asp Pro Val Val Val 



710 



660 665 670 

Asp Lys Asp Asp Asp Glu Asp Gly Asp Glu Asp Gly Asp Glu Arg Glu 

675 680 685 

Glu Val Ala Arg Glu Thr Gly Gly Arg His Val Gly Tyr Gin Gly Ala 
690 695 700 

Phe Asp Phe Ala Ser Leu Tyr Pro Ser lie lie Gin Ala His Asn Leu 

725 730 35 

Cvs Phe Ser Thr Leu Ser Leu Arg Pro Glu Ala Val Ala His Leu Glu 

740 745 750 

Ala Asp Arg Asp Tvr Leu Glu He Glu Val Gly Gly Arg Arg Leu Phe 

75 1 ' 760 7 65 

Phe Val Lys Ala His Val Arg Glu Ser Leu Leu Ser lie Leu Leu Arg 

770 77 5 780 

Asp Trp Leu Ala Met Arg Lys Gin He Arg Ser Arg He Pro Gin Ser 

Pro Pro Glu Glu Ala Val Leu Leu Asp Lys Gin Gin Ala Ala lie Lys 

805 810 815 

Val Val Cys Asn Ser Val Tyr Gly Phe Thr Gly Val Gin His Gly Leu 

820 825 8 

Leu Pro Cys Leu His Val Ala Ala Thr Val Thr Thr lie Gly Arg Glu 

835 840 845 

Met Leu Leu Ala Thr Arg Ala Tyr Val His Ala Arg Trp Ala Glu Phe 

850 855 860 

Asp Gin Leu Leu Ala Asp Phe Pro Glu Ala Ala Gly Met Arg Ala Pro 
865 870 875 880 

Gly Pro Tyr Ser Met Arg He He Tyr Gly Asp Thr Asp Ser lie Phe 

885 890 895 

Val Leu Cys Arg Gly Leu Thr Gly Glu Ala Leu Val Ala Met Gly Asp 

900 905 9 

Lys Met Ala Ser His He Ser Arg Ala Leu Phe Leu Pro Pro He Lys 

1 915 920 925 

Leu Glu Cys Glu Lys Thr Phe Thr Lys Leu Leu Leu He Ala Lys Lys 

930 93 5 940 

He Gly Val He Cys Gly Gly Lys Met Leu 
945 950 955 960 

„ ... .__ Lys Asn Cvs Ala Phe He Asn Arg Thr Ser 

965 ' 970 975 



Lys Tyr He Gly Val He Cys Gly Gly Lys Met Leu He Lys Gly Val 
Qkc 950 955 960 

Asp Leu Val Arg Lys Asn Asn Cys Ala Phe He Asn Arg Thr Ser Arg 

965 970 975 

Ala Leu Val Asp Leu Leu Phe Tyr Asp Asp Thr val Ser Gly Ala Ala 

980 985 
Ala Ala Leu Ala Glu Arg Pro Ala Glu Glu Trp Leu Ala Arg Pro Leu 

995 1000 1005 

Pro Glu Gly Leu Gin Ala Phe Gly Ala Val Leu Val Asp Ala His Arg 

1010 1015 1020 

Arg lie Thr Asp Pro Glu Arg Asp He Gin Asp Phe Val Leu Thr Ala 
02I 1030 1035 iu * u 

Glu Leu Ser Arg His Pro Arg Ala Tyr Thr Asn Lys Arg Leu Ala Hxs 

1045 1050 J.us:> 

Leu Thr Val Tyr Tyr Lys Leu Met Ala Arg Arg Ala Gin Val Pro Ser 

1060 1065 10 /u 

He Lys Asp Arg He Pro Tyr Val lie Val Ala Gin Thr Arg Glu Val 

1075 1°80 1085 

Glu Glu Thr Val Ala Arg Leu Ala Ala Leu Arg Glu Leu Asp Ala Ala 

1090 1095 1100 

Ala Pro Gly Asp Glu Pro Ala Pro Pro Ala Ala Leu Pro Ser Pro Ala 

n _ 1110 111b AA^V 

Lys Arg Pro Arg Glu Thr Pro Ser His_Ala Asp Pro Pro Gly^Gly Ala 
Ser Lys Pro Argus' Leu Leu Val Se/L .Leu Ale Glu Asp Pro Gly 

1 140 1145 

Tyr Ala He Ala Arg Gly val Pro Leu Asn Thr Aspjyr Tyr Phe Ser 

His Leu^eu Gly Ala Ala Cys"" Thr Phe Lys Ala Leu Phe Gly Asn 

11*75 11B0 
Asn^Ia Lys He Thr Glu Ser Leu Leu Lys Arg Phe He Pro Glu Thr 
ipe 1190 1195 a*vv 

Trp His Pro Pro Asp Asp Val Ala Ala Arg Leu Arg Ala Ala Gly Phe 

1205 121° 121 

Gly Pro Ala Gly Ala Gly Ala Thr Ala Glu Glu Thr Arg Arg Met Leu 
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1220 1225 1230 

His Arg Ala Phe Asp Thr Leu Ala 
1235 1240 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1012 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 
(Vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 



Met 


Asp 


Ser 


Val 


Ser 


Phe 


Phe 


Asn 


Pro 


Tyr Leu Glu 


Ala Asn Arg Leu 


1 








5 










10 




15 


Lys 


Lys 


Lys 


Ser 


Arg 


Ser 


Ser 


Tyr 


He 


Arg He 


Leu 


Pro Arg Gly He 








20 










25 






30 


Met 


His 


Asp 


Gly 


Ala 


Ala 


Gly 


Leu 


lie 


Lys Asp 


Val 


Cys Asp Ser Glu 






35 










40 








45 


Pro 


Arg 


Met 


Phe 


Tyr 


Arg 


Asp 


Arg 


Gin 


Tyr Leu 


Leu 


Ser Lys Glu Met 




50 










55 








60 




Thr 


Trp 


Pro 


Ser 


Leu 


Asp 


He 


Ala 


Arg 


Ser Lys 


Asp 


Tyr Asp His Met 


65 










70 








75 




60 


Arg 


Met 


Lys 


Phe 


His 


He 


Tyr 


Asp 


Ala 


Val Glu 


Thr 


Leu Met Phe Thr 










85 










90 




95 


Asp 


Ser 


He 


Glu 


Asn 


Leu 


Pro 


Phe 


Gin 


Tyr Arg 


His 


Phe Val He Pro 








100 










105 






110 


Ser 


Gly 


Thr 


Val 


He 


Arg 


Met 


Phe 


Gly 


Arg Thr 


Glu 


Asp Gly Glu Lys 






115 










120 








125 


He 


Cys 


Val 


Asn 


Val 


Phe 


Gly 


Gin 


Glu 


Gin Tyr Phe 


Tyr Cys Glu Cys 




130 










135 








140 




Val 


Asp 


Gly 


Arg 


Ser 


Leu 


Lys 


Ala 


Thr 


He Asn 


Asn 


Leu Met Leu Thr 


145 










150 








155 




160 


Gly 


Glu 


Val 


Lys 


Met 


Ser 


Cys 


Ser 


Phe 


Val He 


Glu 


Pro Ala Asp Lys 










165 










170 




175 


Leu 


Ser 


Leu 


Tyr 


Gly 


Tyr 


Asn 


Ala 


Asn 


Thr Val 


Val 


Asn Leu Phe Lys 








180 










185 






190 


Val 


Ser 


Phe 


Gly 


Asn 


Phe 


Tyr 


Val 


Ser 


Gin Arg 


He 


Gly Lys He Leu 






195 










200 








205 


Gin 


Asn 


Glu 


Gly 


Phe 


Val 


Val 


Tyr 


Glu 


He Asp 


Val 


Asp Val Leu Thr 




210 










215 








220 




Arg 


Phe 


Phe 


Val 


Asp 


Asn 


Gly 


Phe 


Leu 


Ser Phe 


Gly 


Trp Tyr Asn Val 


225 










230 








235 




240 


Lys 


Lys 


Tyr 


He 


Pro 


Gin 


Asp 


Met 


Gly 


Lys Gly 


Ser 


Asn Leu Glu Val 










245 










250 




255 


Glu 


He 


Asn 


Cys 


His 


Val 


Ser 


Asp 


Leu 


Val Ser 


Leu 


Glu Asp Val Asn 








260 










265 






270 


Trp 


Pro 


Leu 


Tyr 


Gly 


Cys 


Trp 


Ser 


Phe 


Asp He 


Glu 


Cys Leu Gly Gin 






275 










280 








285 


Asn 


Gly 


Asn 


Phe 


Pro 


Asp 


Ala 


Glu 


Asn 


Leu Gly Asp 


lie Val He Gin 




290 










295 








300 




He 


Ser 


Val 


He 


Ser 


Phe 


Asp 


Thr 


Glu 


Gly Asp Arg 


Asp Glu Arg His 


305 










310 








315 




320 


Leu 


Phe 


Thr 


Leu 


Gly 


Thr 


Cys 


Glu 


Lys 


He Asp 


Gly 


Val His He Tyr 










325 










330 




335 


Glu 


Phe 


Ala 


Ser 


Glu 


Phe 


Glu 


Leu 


Leu 


Leu Gly 


Phe 


Phe He Phe Leu 








340 










345 






350 


Arg 


He 


Glu 


Ser 


Pro 


Glu 


Phe 


He 


Thr 


Gly T\ 


Asn 


lie Asn Asn Phe 




355 










360 








365 


Asp 


Leu 


Lys 


Tyr 


Leu 


Cys 


He 


Arg 


Met 


Asp Lys 


He 


Tyr His Tyr Asp 




370 










375 








380 




He 


Gly 


Cys 


Phe 


Ser 


Lys 


Leu 


Lys 


Asn 


Gly Lys 


He 


Gly He Ser Val 


385 










390 








395 




400 
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Pro 


His 


Glu 


Gin 


Pne 


Tnr 


Ser 


uiy 








420 


Lys 


lie 


Thr 


Ala 






A 1 C 




Leu 


Gin 


Gin 


ulU 




450 






Pne 


lie 


Ser 


Giy 


4ob 








Gin 


Asp 


Ser 


vai 


Pne 


Glu 


val 


Ala 








500 


Val 


val 


Phe 


GlU 






515 




GlU 


Ala 


Lys 


Arg 




530 






Asn 


Arg 


Gin 


Gly 


545 








Thr 


Gly 


Tyr 


Tyr 


Tyr 


Pro 


Ser 


lie 






580 


Leu 


Asp 


Glu 


Arg 






595 




Val 


Lys 


Leu 


Gly 




610 






Glu 


Ser 


Val 


Leu 


625 








Glu 


Val 


Lys 


Ala 


Leu 


Leu 


Asp 


Lys 








660 


Tyr 


Gly 


Val 


Thr 






675 




Ala 


Ala 


Ser 


Val 




690 






Asp 


Tyr 


Val 


Asn 


•7 f\ e 

705 








Phe 


Gly 


Leu 


Thr 


lie 


Tyr 


Gly 


Asp 








740 


Asn 


Gin 


Ser 


Leu 






755 




Asp 


Arg 


Leu 


Phe 




770 






Cys 


Pro 


Leu 


lie 


785 








Asp 


Ser 


Leu 


Leu 


Cys 


Asp 


Phe 


Val 








820 


Phe 


Asp 


Glu 


Glu 






835 




Gin 


Thr 


Gin 


Leu 




850 






Leu 


Arg 


Arg 


Leu 


865 








Asp 


Val 


Arg 


His 


Ala 


Tyr 


Lys 


Gin 








900 


Ala 


Gin 


Arg 


Lys 






915 




Val 


Leu 


lie 


Ala 




930 






Leu 


Ala 


Glu 


Asp 



945 



Tyr 


Arg 


i»ys 


oiy 










Vox 


x*eu 


xyr 


T .91 1 


m n 


A en 


Tvr 


Lys 








440 


Lys 


Glu 


Gin 


Leu 






455 




Pro 


C A 




Arg 




a "7 n 






T At « 

Leu 


vai 


vai 


Arg 


HOD 








Glu 


val 


Aia 


Arg 


taiy 


Gin 


uin 


Lys 








520 


Arg 


Asn 


Met 


He 










■rl- 
IiS 


Gly 


Tyr 


Lys 




ccn 
jjU 






Ala 


Val 


Pro 


Tnr 


DO D 








Met 


Met 


Ala 


HIS 


Gin 


He 


Ala 


Gly 








600 


Asp 


Glu 


Thr 


His 






olb 




Gly 


Ser 


Leu 


Leu 




630 






Glu 


Met 


Gin 


Asn 


645 








Lys 


Gin 


Leu 


Ala 


Gly 


Ala 


Ala 


His 








680 


Thr 


Cys 


Leu 


Gly 






f o c 




Ser 


Lys 


Met 


Gin 




710 






Ser 


Ser 


Asp 


Phe 


725 








Thr 


Asp 


Ser 


He 


Arg 


Arg 


He 


Ala 








760 


Lys 


Ser 


Pro 


He 




775 




Leu 


He 


Cys 


Lys 




•inn 

790 






He 


Phe 


Lys 


Gly 


805 








Lys 


Gly 


Val 


Val 


Val 


Gin 


Tnr 


Ala 








840 


Arg 


Glu 


Gin 


Gly 






855 




Cys 


Glu 


Ala 


Arg 




870 






Leu 


Met 


Leu 


Ser 


865 








Pro 


Asn 


Leu 


Aia 


Glu 


Glu 


He 


Pro 








920 


Pro 


Ser 


He 


Gly 






935 




Pro 


Asn 


Tyr 


Val 




950 







•62- 



fne 


Leu 


Gin 


Aia 




**. J. w 






Asp 


Met 


1 y x 


Pro 


425 












Thr 










Glu 








460 


Ala 


Val 


Val 


Gly 






4 / D 




Leu 


Fne 


Lys 


Gin 




4 ?U 






Leu 


Ala 

Aia 


IT. 

ms 


vai 


DUD 








Lys 


Tie 

ne 


rile 


Pro 


LicU 


pro 


Car 
OCX 


net. 








540 


Gly 


Ala 


Thr 


Val 






DDD 




vai 


vai 


pne 


Asp 




C. "7fl 
D I \J 






Asn 


Leu 


Cys 


Tyr 


CDC 








Leu 


Ser 


GlU 


O A V 

ser 


Arg 


nv> a 

pne 


vai 


Lys 








620 


Lys 


Asp 


Trp 


Leu 






bib 




Cys 


Ser 


Asp 


Pro 




650 






Leu 


Lys 


Thr 


Thr 


665 








Gly 


Leu 


Leu 


Pro 


Arg 


Glu 


Met 


Leu 








700 


Ser 


Glu 


Gin 


Phe 






715 




Thr 


Gly 


Asp 


Leu 




Tin 

7 JU 






Phe 


Met 


Ser 


Val 


745 








Pro 


Met 


He 


Ala 


Lys 


Leu 


GlU 


Pne 






780 


Lys 


Arg 


Tyr 


He 






795 




Val 


Asp 


Leu 


Val 




810 






Lys 


Asp 


T T A 

He 


vai 


825 








Ala 


Val 


Glu 


Pne 


Val 


Pro 


Val 


Gly 








860 


Glu 


Glu 


Leu 


Phe 






875 




Ser 


Val 


Leu 


Ser 










His 


Leu 


Ser 


Val 


905 








Asn 


Val 


Gly 


Asp 


Asn 


Lys 


Gin 


Thr 








940 


He 


Glu 


His 


Lys 



955 



Gin Thr Lys Val 
415 

Val Tyr Ser Ser 
430 

Ala Lys He Cys 
445 

He Pro Lys Lys 

Lys Tyr Cys Leu 
480 

He Asn Tyr His 
495 

Thr Ala Arg Cys 
510 

Cys He Leu Thr 
525 

Val Ser Ser His 

Leu Glu Pro Lys 
560 

Phe Gin Ser Leu 
575 

Ser Thr Leu Val 
590 

Asp He Leu Thr 
605 

Pro Cys He Arg 

Ala Lys Arg Arg 
640 

Met Met- Lys Leu 
655 

Cys Asn Ser Val 
670 

Cys Val Ala He 
685 

Cys Ser Thr Val 

Phe Cys Glu Glu 
720 

Glu Val Glu Val 
735 

Arg Asn Met Val 
750 

Lys His He Thr 
765 

Glu Lys He Leu 

Gly Arg Gin Asp 
800 

Arg Lys Thr Ser 
815 

Asp Leu Leu Phe 
830 

Ser His Met Thr 
845 

He His Lys He 

Gin Asn Arg Ala 
680 

Lys Glu Met Ala 
895 

He Arg Arg Leu 
910 

Arg He Met Tyr 
925 

His Asn Tyr Glu 

He Pro He His 
960 
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Ala Glu Lys Tyr Phe Asp Gin He He Lys Ala Val Thr Asn Ala He 

965 970 975 

Ser Pro He Phe Pro Lys Thr Asp lie Lys Lys Glu Lys Leu Leu Leu 

980 985 990 

Tyr Leu Leu Pro Met Lys Val Tyr Leu Asp Glu Thr Phe Ser Ala He 

995 1000 1005 

Ala Glu Val Met 
1010 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1013 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

lv) FRAGMENT TYPE : internal 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ser Ser Val Asn Leu Met Glu Trp Ser Ala Leu Lys Thr Gin Leu 

1 5 10 15 

Gin Ala Gly Arg Asp Ala Gly Lys Ala Arg Val Ser He Gly Pro Ala 

20 25 30 

Asp Thr Ala Arg He Thr Arg Met Thr Tyr Ala Asp Asn His Leu He 

35 40 45 

Val Phe Met Asn Ala Arg Leu Ala Lys Glu Asn His Arg Leu Tyr Gin 

50 55 60 

Phe Tyr Ala Glu Val Arg Cys Asp Leu Tyr Ser Tyr Lys Ser Cys Tyr 
65 70 75 80 

Gly Thr His Ala Ser Ala Thr Cys His Arg Asn Cys Thr Ser Tyr Lys 

85 90 95. 

Thr Phe Val Met Pro Gly Leu Arg Asp Val His Thr Asp Lys Leu His 

100 105 HO 

Val Val Lys Phe Lys Arg Ser Asp Glu Lys Arg Asp Lys Asn Cys Leu 

115 120 125 

Asr> Glv Tvr Leu Ala Asp Val Asn Arg Val His Met Gin Thr Ser Leu 

P 130 135 140 

Leu Glu Gly Gin Tyr Val Arg Phe Lys Asn Ala His Ala Cys Arg Asp 

150 155 160 



TVr Arq Leu Ser His Thr Ala Lys Asp Val His Glu Phe Glu Ser Met 



165 



170 175 



Leu Glu Arg Val Gin Val Ser Ala Leu Ser His Glu He Leu Pro Val 

180 185 190 

Val Ala Cvs Tyr Asp He Glu Thr His Ser Asp Gly Gin Arg Phe Ser 

X9S 200 205 

Ala Pro Asp Ala Asp Phe He He Ser lie Ala Val Val Val Arg Arg 

210 215 220 

Asp Ala Ala Asp Thr Arg He Cys Leu Phe Tyr Ser Pro Asp Asp Pro 
225 230 235 240 

Val Asd Leu Ser Ser Ser Ser Ser Ser Pro Pro Ala Ala Pro Asp Thr 
245 250 255 



Ala Ala Val His Phe Arg Ala Glu Arg Asp Met He Ala Ala Phe Phe 

Gin Leu Leu Pro" Leu Leu Asn Ala Asp Val Val Leu Asp Phe Asa Gly 

2 7 5 280 285 

Asp Lvs Phe Asp Leu Pro Phe Leu Thr Gly Arg Ala Asn Lys Leu Cys 

290 295 300 

Gly Pro Ala Glu Ala Ala Arg Ala Thr Lys lie Ala Arg Tyr Asp Leu 
305 31° 315 

Ser Pro Val Asn Val Val Thr Gin Gin Ser Tyr Asp Lys Phe Ser Asn 

325 330 335 

Lvs Leu His Ser His Tyr Leu Thr Tyr Tyr He His He Asp Leu Tyr 

* 340 345 350 

Gin Phe Leu Ser Thr Asp Ser Glu His Asn Asp Leu Glu Asn Phe Gin 
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355 360 365 

Leu Asn Thr Val Ala Glu His Tyr Leu Lys Lys Ser Lys Val Asp Leu 

370 - 375 380 

Pro He His Asp Met Leu Gin Met Tyr Gly Glu Lys Arg Leu Ser Arg 
385 390 395 400 

He Val Glu Tyr Asn Val Gin Asp Cys Val Leu Pro Val Glu Leu Phe 

405 410 415 

Leu Lys Leu Glu He Ala Asp Tyr Met Tyr Thr Gin Cys Met Leu Leu 

420 425 430 

Tyr Leu Cys Thr Asd Asp Leu Leu Arg Asn He Ser His Lys He Thr 

435 " 440 445 

Val Ala Tyr Phe His Leu Ala Leu Thr Asn Thr Val Ala Arg Arg Pro 

450 455 460 

Asp Pro Thr Pro Asp Pro Tyr Phe Phe Asn Lys Tyr Asp Leu Ser Val 
465 470 475 480 

— Thr Ser Gly Ala Ser Ala Pro Ser Thr Ser Arg Pro Ala Asn Ala He 

485 490 495 

Asp Leu Ser Gin Leu Lys Arg Thr Pro Val Asp Ala Ala Arg He Pro 

500 505 510 

Pro Ser Ala Val Lys Leu Cys Ser Thr Arg Gin Ser Cys Thr Tyr Lys 

515 520 525 

Gly Gly Lys Val Leu Ser Pro Lys Pro Gly Phe Asn Arg Trp Val Ala 

530 535 540 

Thr Leu Asp Phe Asn Ala Leu Tyr Pro Thr He Met Met Trp Glu Gly 
545 550 555 560 

Val Cys Met Ser Ser Asn Val Phe He Ala Ser Asp Gly Asn Val Tyr 

565 570 575 

Leu Asp Lys Asn Val Asn Ala Val Asn Pro Lys Leu Leu Lys Thr Leu 

580 5B5 590 

Ser Glu Met Arg Val Arg Tyr Lys Gly Leu Arg Asp Gin Cys Glu Tyr 

595 600 605 

Asn Ser Phe Tyr Tyr Lys Leu Tyr Asp Lys lie Gin Asn Ala Leu Lys 

610 615 620 

Arg He Ala Asn Ser lie Tyr Gly Tyr Tyr Gly He Phe Phe Lys Pro 
625 630 635 640 

Leu Ala Asn Tyr lie Thr Lys Met Gly Arg Gly Lys Leu Lys Glu Val 

645 650 655 

Val Gly Lys Val Glu Ala Met Ser Asp Asp Pro Arg He Leu Arg Glu 

660 665 670 

Phe Gly Leu Ser Lys He Asn Phe Ser Val lie Tyr Gly Asp Thr Asp 

675 680 685 

Ser Cys Phe He Arg Val Leu Phe Asp Glu Ala Glu Trp Arg Arg Thr 

690 695 " 700 

Ala Ala Arg Pro Arg Ser Ala Pro Ser Cys Arg Thr Thr Cys Ala Lys 
705 710 715 720 

Arg Ser Thr Thr Leu Trp Cys Gly Tyr Lys Met Ser Leu Glu Asn He 

725 730 735 

Met Leu Ser Leu He Leu Leu Lys Lys Lys Lys Tyr Cys Tyr Leu Asn 

740 745 750 

Asn Glu Gin Arg Thr Lys Tyr Lys Gly Trp Leu He Lys Arg Asp Met 

755 760 765 

Pro Leu Phe Met Arg Lys Ala Phe Arg Ala Thr Val Asp Ser Phe Ser 

770 775 780 

Ala Ala Thr Arg Arg Val Arg Ala Arg Pro Ala Arg Arg Glu Met Leu 
785 790 795 800 

Arg Tyr Tyr Arg Glu Phe Gly Ala Pro Arg Glu Asn Leu Val Asp Tyr 

805 810 815 

Cys Phe Ser Met Ser Tyr Asn Glu Thr Ser Thr Thr Ala Lys Arg Arg 

820 825 830 

Lys Glu Glu Asp Pro Ala Arg Lys Pro Val lie Thr lie Ala T.ys His 

835 840 845 

Cys Arg Glu Leu Leu Ala Asn Pro Gly Val Asp Phe Leu Pro Gly Asn 

850 855 860 

Gly Asp Arg lie Gin Tyr Val Leu Val Asp Val Lys Glu Lys lie Thr 
865 870 875 880 

Gin Lys Ala Phe Pro Leu Lys Leu Phe Asp Pro Asp Ser Pro Thr Leu 

885 890 895 

Gin He Ser Trp Leu Lys His Met Asn lie Leu Cys Thr Phe Met Asn 

900 905 910 

Glu Leu He Gin Val Phe Gly Asn Arg Pro Glu Phe Glu His Tyr Phe 



SUBSTITUTE SHEET (RULE 26) 



WO 97/01578 



PCT/US96/10958 



-65- 

915 920 925 

Gly Ala lie Val Asp Glu Tyr Thr Ser Ala Gin Met Tyr Asp Val Arg 

930 935 940 

Tyr Pro Val Leu Val Pro Thr Arg Arg Ala Lys Ala Glv Lvs Ser Ala 
945 950 955 * ' 960 

Lys Lys Asn Asp Ser Asp Ser Asp Ser Asp Ser Asp Asp Asp Asp Asp 

965 970 975 

Pro Ala Thr Thr Pro Val Asn Tyr His Ser Leu Phe Ser Met His Leu 

980 985 990 

Lys Lys Pro Lys Arg Gin Ala Val Gly Glu Phe Glu Pro Cvs Pro Gin 

995 1000 1005 

Cys Val Ala Arg Ala 
1010 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



Met Asp 


Arcr 


Asn 


Ala 


Val 


Leu 




Glv 


Val 


Leu 


Glu Hi^ Ar*o Li^n Prn 
uiu axs /iiy ucu 


1 








5 










10 




1C 
Aw 


Lys 


Trp 


Val 


Glu 


Leu 


Ser 


Asp 


ASp 


Thr 


ASp 


Leu 


Glu Pro Phe Phe Phe 








20 










25 






30 


Ser 


Ser 


Val 


Arg 


Tyr 


He 


Thr 


Ala 


Gly 


Ser 


Glu 


Asp Ala He Met He 






35 










40 








45 


Gin Ala 


Leu 


Asn 


Leu 


Asn 


Thr 


Asp 


Glu 


He 


Val 


Val Phe Leu Val Thr 




50 










55 










60 


Asn 


Leu 


Asn 


Phe 


Met 


Ala 


Leu 


He 


Pro 


Thr 


Val 


Tyr He Glu Asn Pro 


65 










70 










75 


80 


Gly He 


Arg 


Gin 


Leu 


He 


Ala 


Ser 


Thr 


Pro 


He 


Ser Tyr Arg Ser Pro 










85 










90 




95 


He 


Thr 


val 


Phe 


Asn 


Gly 


Asp 


Leu 


Lys 


Lys 


Trp 


Met Asp Cys Asp Leu 








100 










105 






110 


Phe 


Val 


Phe 


Gly 


Thr 


Met 


Ala 


Ala 


Gin 


Lys 


Ala 


Phe He Lvs Ala Gly 






115 










120 








125 


Asn 


Ser 


Val 


Leu 


Gly 


Ser 


Leu 


Gly 


Gly 


Asn 


Val 


Tyr Thr Tyr Gly Asp 




130 










135 










140 


His 


Val 


Ser 


Asn 


Phe 


Asp 


Gly 


Asn 


Thr 


Pro 


Val 


Leu Gin Asn Asn Leu 


145 










150 










155 


160 


Met 


Cys 


Ser 


His 


Val 


Tyr 


Tyr 


Thr 


Arg 


Tyr 


Lys 


Thr Asp Val Tyr Ala 










165 










170 




175 


Pro 


Trp 


Glu 


Phe 


Tyr 


Tyr 


Asp 


Gin 


Lys 


Arg 


Asp 


Gin Gly Tyr Leu Met 








180 










185 






190 


Ser 


Leu 


Pro 


Ala 


He 


lie 


Pro 


Arg 


Cys 


Lys 


Arg 


Glu Gly Ala Phe Asp 






195 










200 








205 


He 


Glu 


Thr 


He 


Val 


His 


Glu 


Asn 


Ala 


Met 


Asp 


Gin Asp Leu Asn Cys 




210 










215 










220 


Gin 


Lys 


Phe 


Phe 


Lys 


Ser 


Glu 


Phe 


Arg 


Ser 


Met 


Glu Glu Ser Gin Val 


225 
















235 


240 


Leu 


He 


Gin 


Arg 


Phe 


Arg 


Glu 


Ala 


Gly 


Val 


Thr 


Gly Leu Pro Pro Ser 








245 










250 




255 


Pro 


Phe 


Val 


Gly 


He 


Thr 


Gin 


Lys 


Leu 


His 


Glu 


He Val Ser He Ser 








260 










265 






270 


Leu 


Val 


Val 


Cys 


Asn 


Tyr 


His 


Lys 


Thr 


Gly 


Pro 


Lys Lys Lys Glu Tyr 






275 










280 








285 


Tyr Val 


Tyr 


Tyr 


Asn 


Thr 


Lys 


Lys 


Met 


Glu 


Asn 


Pro Met Glu Met He 




290 










295 










300 


Pro 


Val 


Glu 


His 


Leu 


His 


Leu 


Asp 


Ala 


Ser 


Arg 


He Lys Phe Glu Ala 


305 










310 










315 


320 
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Cvs 

v_ye» 


Lys 


Asn 


Glu 


Phe 


Tvr 


Met 


Leu 


Leu 


Ala Phe lie Asn Arg Leu Arg 








325 










330 335 


LVS 


Ser 


Val 


Asn 


Val 


Leu 


Tyr 


Val 


Tyr 


Asn Ala Gin Phe Asp lie Gin 






340 










345 


350 


Val 


lie 


Gin 


Gin 


Arc 


Leu 


Arc 


Tvr 
y 


Tvr 
y 


Ala Phe Lys Gin Arg Ala Pro 






355 










360 




365 


Arc* 


Cvs 


Cvs 

v*y a 


Lvs 


Glv 


His 


Asd 


Asp 


lie 


Pro His Glu Trp Gly Lys Ala 




370 










3 75 






380 


LeU 


Met 


Glu 


Lys 


Trr> 


Glu 


Ala 


Phe 


Leu 


Car* Val Lvs Pro Gin Leu Phe 


385 










390 








395 400 


Lys 


Ala 


Gin 


lie 


Leu 


Met 


Glv 


Gin 


Asp 


He Leu Lvs Ala Asn Tvr Leu 








405 










410 415 


Lys 


Leu 


Leu 


Glu 


Glv 


lie 


Glv 


Ser 


Val 


Leu Ala. Gin Ala Lvs Ser Thr 






420 










425 


430 


Met 


Ala 


Lys 


Met 


Cvs 


Thr 


lie 


Lys 


Glu 


Aro He Asrj Ser Tvr Ara Lvs 






435 










440 




445 


Met 


Lys 


Asp 


Thr 


Val 


Gin 


Asn 


Phe 


Lys 


Ser Hi ^ Glv Phe Glv Cvs Asn 




450 










455 






460 


lie 


lie 


Asp 


Met 


Met 


Tvr 
xy x 


Val 


Cys 


Lys 


Ara Lvs Glu Phe Glu Ala. Lvs 


465 










470 








475 460 


A cn 


Glv 
uxy 


Ser 


Leu 


Asn 


Thr 


Val 


Ala 


Gin 


Leu Tie Tie Lvs Lvs Phe Lvs 

uwu xxw xx^ xj y w Arf y ^ at iaw uy m 










485 










490 495 


Pro 




Xjys 




Thr 

X llX 


Prn 

XT X W 


T .w c 
ijy a 


Tip 
XXC 


nx o 


t.vq Mar* 2Vct% A cn Tla HThr* Tvrr" 
xjy a i*ic v~ iioy /iop x xc x nx x y x 








500 










505 


510 


ASp 


Lys 


Leu 




uxy 


Tyr 


Tyr 


Arg 


A 1 A 
Hid 


ri] xr C2*\ \r Thr- T vc Tla Hi a Cl n 
uxy uiy XXXX xiyo XXc AXa uXU 






Jlw 










J X \J 




Z3 4 J 


Cys 


Leu 


Tip 

XXc 


xyr 


MS IX 


Leu 


Tla 
lie 


A pa 


Coy 
OCX 


T .01 1 T oi i \Ts> 1 Tla 2i ttt Tla ZV 1 a 
XxcU XjcU Vax XXc >ixy XXc AXa 




J 










j j 3 








Lys 


Asn 


Leu 


Lys 


Pro 


Mat- 
net. 




fTil 11 
ulU 


Tyr 


Tla Tirv & f2. In T ai i 2i 1 a Pi/e 
XXc Xyr >i-X7y vsXlX XjcU nXa vys 












«sS0 

7 J u 








333 3 D w 


Tyr 


Asn 


Tl A 


Asp 


Thr 

XXXX 


i-iX a. 


Ll a 
nla 


flXS 


Thr 

XXXX 


A «>*/»t o 1 > » it* 1 Mar* A cti Dha f*\Tc 

/\xvy uiy voix ixicu /ioxx xrxxc uys 




















3 / \J 3 f 3 


/-» i - . 

laiy 


fixe 


Tla 
IXc 


fSl 


Ser 


X XXX 


Lys 


Val 
V cl X 


Va 1 
V cl X 


UlU V clX Ocx i~ix y nail uya nXa 
















585 


590 


AX3 


Leu 


Asp 


Al a 


uxy 


Tl e 

XXC 


Val 
vol 


Mot 

ric w 


Ala 


Thr A en Tvr Tla IVrfi Zk.cn Car 
xxxx ao^j x y x xxc <rvxy i^oix ocx 




















COR 

O w 3 




xrxie 


Thr 
X XIX 


Pro 


UXU 


Thr 
X XIX 


Tl 
XXC 


Pro 


Arg 


a rrr ni v Rl v Pha Val Mar il a 
/ii y uxy uxy xrxxc vax ncu nxa 




610 










615 






620 


Pro 


T.an 
Xjcu 


X XXX 


vj-L y 


T.an 


pho 


Phe 


ax a 


Aro 


Prn Thr Gin Cvs Phe Glu Leu 

clU X 1XX w 111 L>jrO Jr IXC VJX ix mC u 


O a 3 










630 








635 640 


Cys 


Leu 


iiap 


xrXXc 


Thr 

XXXX 




Mar 


ryr 


Pro 

Xr X U 


Car Ma r Mar Pvc Asn Leu Asn 
ocx i ic l. i T icw uyo ucu aau 






645 










650 655 


Tip 


Car 




uiu 


Thr 

X llX 


Tip 
XXC 


Val 




Car 
OCX 


A^rj T.vc Thr Asn Am Val Glv 

rio^ uy« 1 11X nsu f%~L. y vox uxy 








660 










665 


670 


A 

ASp 


xyr 


Met 


oxy 


Tyr 


ASp 


Trp 


Cat* 
OCX 


Lys 


Tla fi cr> ^In n 1 \/ Dha f2l 11 T .vrc 
xxc nop uxxx uxy xriic uiu x»y o 






O t 3 










680 




685 


Dha 

XT IlC 


Thr* 
1 ill 


Leu 


wax 


T.an 


&rn 


vox 


Asp 


-Hxy 


Thr 2\^n Pro ftl n A<;n Pro T.vs 

1 11X AsU XT X w w X 14 nSll * X w Ujr S 














695 






700 


Leu 


v=l 

Vai 


Arg 


Hi e 

tixs 


Thr* 
X XXX 


Cot* 
OCX 


Asp 


Thr 
XXX X 


Car 
OCX 


T.an Cor T.an T.vre Arn Tvrr 7 .pi i 
Xjcu ocx xjcix xjyo /u>^ ± y i. ucu 


7 AC 










710 








715 720 


Arg 


Leu 


ni y 


X XXX. 


UX U 


Hi s 


T .\f<z 

*->y *s 


ft rn 


Ala 


Leu Lvs Gin Ser Ser Glv Ser 

Uj« will *7 w X wwX vl Jr wCX 








725 










730 735 


vai 


lis 
Ax a 


uXU 


Tyr 


Hi e- 
XTXo 


Hen 




T .01 1 


Gin 


Asn £21 u Met Lvs lie Cvs Thr 
ridi* ux u nc w xj ya xxc v. y o x nx 








7 An 










74 ^ 


750 


Asn 


Tnr 


HIS 


Tyr 


uxy 


Va 1 
VdX 


ser 


r:l ii 

uXU 


nxa 


Thr Pvq Ccr T.eu Met* Tie Thr 
x iix cys ocx ucu ucu xxc iiix 














7C0 




765 


Thr 


Gin 


Gly 


Gin 


rixS 


Lys 


Tl a 

lie 


Lys 


Leu 


Val lien 1 1 i Pha Tla T.vq Thr 
VdX /l^XX Ul U XrXlC XXC Uya 1 XXX 




»7 r\ 

77 0 








/ / 3 






7B0 


Leu 


Asn 


Arg 


Thr 


Gly 


XllS 


Ser 


T a 

_eu 


rne 


r> y> Tv^« T^rr 1 ^ f A p-\ Thr* A ^ t"i 

pro Asn Ayr biy itsp mr >isp 


785 
















795 900 


Ser 


Thr 


Met 


Leu 


Tyr 


tJ -i 

WIS 


Pro 


Ca 

oer 


Asp 


Car ^Tlln Th r CZ 1 n T .a 1 1 fZ 1 1 1 
wXU OCX UXU X XXX UXIX XjCIX UXU 










605 










810 815 


Asp 


Met 


Val 


Thr 


Leu 


Glu 


Asp 


Glu 


Met 


Arg Ala Glu Leu Arg Glu Tyr 






820 










825 


830 


Met 


Leu 


Lys 


Lys 


Leu 


Ser 


Ala 


Glu 


Leu 


Val Asn Arg Val Lys Glu Lys 






835 








840 




845 


Thr 


Lys 


Arg 


Thr 


Asp 


Thr 


Phe 


Val 


Gin 


Ser Phe Leu Ser Asp Val Glu 




850 










855 






860 


Thr 


Val 


Leu 


Phe 


Asp 


Asp 


Met 


Val 


Glu 


Lys Leu Arg Leu Phe Ser Gin 



865 870 875 880 
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Gly Glu 


V CLX 


x xe 


\J X LI 


ri u 


IT 


Lys 


Asp 


VjxV VaXy 


inr 


Trp Trp Val Val 










885 










890 




895 


Asp 


Pro 


Leu 


Thr 
900 


Glv 

■jxy 


He 




Met 


Asd 
905 


r*\/c Coy* 
v-yo oex 


Thr 


Pro Phe Ser Ser 
910 




Leu 


He 
915 


Cvs 


Lys 


Leu 


Glu 


Tvr 
920 


Glu 


7\ rri Ala 

noli nid 


Ser 


Ser He Gly Cys 
925 


His 


Val 


Ala 


Lys 


Lys 


Met 


Val 


OCX. 


xx c 




Thr 


Tyr Leu Phe Phe 




930 










935 








940 


Lys 


Lys 


He 


Ser 


Leu 


Tyr 


His 


Val 


Arg 


Val Trr> 


Arg Met Cys Ala Asp 


945 










950 








955 




960 


Thr 


Asp 


Gly 


Ser 


Pro 
965 


Ser 


His 


Leu 


Tyr 


Phe Pro 
970 


Val 


Ser Leu Ser Arg 
975 


Thr 


Arg 


Ala 


Lys 


Gin Arg 


Gly 


Asp 


His 














980 










985 









(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Met 


Lys Leu 


Lys 


Lys Leu 


Tyr 


He 


Phe 


Tyr 


Phe Asp He Tyr Glu Tyr 


1 






5 








10 


15 


Phe 


Leu Cys 


Asp 


Leu Gin 


Leu 


Ser 


Glu 


Thr 


Asn Glu He Leu Lys Tyr 






20 








25 




30 


He 


Lys Asn Asn 


He Asp Lys 


Tyr 


Thr 


Asn 


Ser Phe Asn Ser Ser Tyr 




35 








40 






45 


He 


He Leu 


Lys 


Asp Phe 


Asn 


He 


He 


Thr 


Asn Glu Val Glu Leu Gin 




50 






55 








60 


Ser 


Tyr Tyr Asn 


Phe Thr Glu Asp Ser 


Lys 


lie Lys Leu Asn Asn Thr 


65 






70 










75 80 


Asp 


Leu He 


Leu 


Phe Met 


Thr 


Pro 


Tyr 


Lys 


lie Glu Arg lie Tyr Ser 








85 








90 


95 


Lys 


Tyr Asn Arg 


Asn Phe 


Asn 


Gin 


Tyr 


Arg Trp Phe Tyr He Leu Asn 






100 








105 




110 


Asn 


He Glu 


Pro 


Ala Gly 


Ser Tyr 


Lys 


He 


Asn Met Ser Asn Leu Gin 




115 








120 






125 


Asn 


He Asn 


He 


Tyr Asp 


Lys 


Asn 


Lys 


Thr 


Ala Tyr Tyr Cys Lys Asn 




130 






135 








140 


Pro 


Lys Leu Leu 


Phe Leu 


Thr 


Pro 


He 


Glu He Asp Lys Phe lie Pro 


145 






150 










155 160 


Val 


Ser Arg Val 


Ser He 


Asp 


He 


Glu 


Cys 


Gin His Phe Gly Glu Phe 








16S 








170 


175 


Pro 


Thr Pro 


Asn 


Lys Phe 


Pro 


He 


Ser 


His 


lie Cys lie Asp Trp Phe 






180 








185 




190 


Met 


Glu Ser 


Asn 


He Asn 


Pro 


Val 


Lys 


Lys 


lie lie Thr Leu He Asn 




195 








200 






205 


Tyr 


Glu He 


He 


Lys Asn 


Tyr 


Lys 


Gly 


Glu 


Gin Lys Asp Arg Phe lie 




210 






215 








220 


Tyr 


Thr Glu 


He 


Asp Glu 


Leu 


Leu 


Thr 


Lys 


Asp Lys Val Tyr He Thr 


225 






230 










235 240 


He 


Tyr Cys 


Thr 


Glu Lys 


Tyr 


Met 


Leu 


His 


Phe lie Leu Tyr Thr Leu 








245 








250 


255 


Arg 


Lys Asp 


Phe 


Asp Tyr 


He 


Leu 


Thr 


Tyr Asn Gly His Ser Phe Asp 






260 








265 




270 


Phe 


Thr Tyr 


He 


Gin Gly Arg Arg 


Lys 


Phe 


Tyr Asn Leu Asn Glu Leu 




275 








280 






285 


Cys 


Leu Val 


Asn 


Ala His 


Lvs 


Ser 


Asn 


Glu 


Leu Lys lie Tyr Ser Tyr 




290 






295 








300 


Asn 


Lys Asp Thr 


Thr Tyr Glu 


He Asp 


Ser Asn Asn Gly He He Phe 
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305 










310 






Leu 


Asp 


Leu 


Tyr 


Asn 


Tyr 


T 1 a 

lie 


Lys 








325 








Lys 


Leu 


Gly 


Glu 


He 


Ala 


T we 










340 










lie 


Asp 


Asn 


Gly Asp Glu 


Tyr 


Tl o 
116 






*3 C C 










JDU 


Asn 


Lys 


Asn 


Lys 


vai 


Ser 


Ti- 
ne 


Phe 




*5 *1 ft 

3 70 










J / D 




Tyr 


Cys 


Phe 


He 


Asn 


Asn 


Tic 

lie 


Pro 


•5 a e 
3 B5 








1 Q A 






lie 


He 


Asn 


Asp 


Lys 


GlU 


Lys 


Leu 










405 








Ser 


Leu 


Tyr 


Gin 


Gin 


Phe 


Lys 


ne 








420 












Asp 


Glu 


Thr 


Thr 


Lys 


Val 


wen 


Leu 






A 1 C 










A A ft 


Asn 


Lys 


Asn 


Ala 


Tyr 


Val 


Asn 


rue 




4 50 














Ala 


Tyr 


Tyr 


Cys 


Thr 


His 


Asp 


Thr 


465 










470 






Tyr 


Asp 


Met 


He 


His 


Asp 


Lys 


Val 










485 








Leu 


Pro 


Gin 


Tyr 


Met 


Ser 


Phe 


Lys 








500 










Gly 


Leu 


Leu 


Leu 


Lys 


Thr 


Leu 


Phe 




515 










520 


Gly 


Asn 


Leu 


Glu 


Phe 


Glu 


Lys 


Phe 




530 










535 




Lys 


Gin 


Lys 


Tyr 


He 


Asp 


Ser 


He 


545 










550 






Glu 


Tyr 


Pro 


Ser 


Asn 


He 


He 


Glu 








565 








Glu 


Lys 


Val 


He 


Lys 


Leu 


Gin 


Asp 








580 










Glu 


Asn 


Tyr 


Leu 


Lys 


Glu 


Lys 


Tyr 






595 










600 


Leu 


He 


Lys 


Lys 


Asp 


Lys 


Thr 


Tyr 




610 










615 




Lys 


Pro 


Gly 


He 


He 


Thr 


Gin 


Met 


625 










630 






Asn 


Glu 


Tyr 


Lys 


Asn 


Leu 


Lys 


Asn 










645 








Tyr 


Asn 


Tyr 


Tyr 


Thr 


Ser 


Ala 


Leu 








660 










Ser 


Leu 


Tyr 


Gly Leu Leu 


Gly 


Ser 






o /b 










£ q n 


Tyr 


Cys 


Ala 


Glu 


Tyr 


Cys 


Thr 


Ala 




690 










c a c 
o 95 




lie 


Lys 


Asn 


Leu 


Val 


Asp 


Lys 


Ser 


705 








710 






Leu 


Asn 


Glu 


Gin 


Asn 


Asn 


Pro 


Phe 










725 








Tyr 


Ser 


Gly 


Asn 


Leu 


Asp 


Val 


Asn 








740 










Asp 


Thr 


Asp 


Ser 


Leu 


Phe 


He 


Asn 




755 










*7 e r\ 
7b 0 


Asn 


Lys 


Glu 


Asp 


Leu 


Val 


Asn 


Lys 




770 










775 




Ser 




He 


He Asn 


A.^n 


r?i u 




785 










790 






Asn 


Phe 


Glu 


Tyr 


Glu 


Lys 


Met 


Tyr 










805 








Lys 


Lys 


Tyr 


He 


Gly Glu 


Val 


Val 








820 










He 


Ser 


Asp 


Ser 


Lys 


Gly 


Thr 


Ala 






835 










840 


He 


His 


Lys 


Thr 


He 


Leu 


Lys 


Asn 




850 








855 




Leu 


Thr 


Asn 


Asn 


Cys 


Thr 


He 


Gin 



•68- 







315 










320 


Lys 


Tie 




Asn 


Tyr 


Asn 


Ser 


Tvx 




330 










335 






Phe 

Zr ll\Z 


Asn 


Ile 


Leu 


Ser 


Lys 


Ile 


345 










350 






lie 


Met" 


It i LJ 


Leu Asp 


Thr 


Ala 


Asp 










365 








iyr 




v a X 


He 


Arg 


Thr 


Ala 


Asn 








380 










Tyr 


Lys 


Tie 


Lys Asp Lys 


Thr 


Lys 






395 










400 


Tyr 




Pro 


He 


Ser 


Ile 


Glu 


Asn 


410 

*x X w 










415 




iyr 


T Arc 




Asn 


Thr 


Pro 


Ile 


Ser 


« 3 










430 






Ser 


_ 

Lys 


Asp 


Asp 


Val 


Asp 


Ile 


Glv 










445 








Thr 


_ 

l^ys 


ASp 


Lys 


Ser Asp 


Asp 


Ile 








460 










vai 


Leu 


Cys 


Asn 


Cys 


He 


riic 


T.vf c 
uy o 






475 










480 


lie 


Aia 


Phe 


Ser 


Asn 


Glu 


iyr 


Leu 




4 90 














Tyr 


Lys 


Ser 


Thr 


Thr 


Asn 


lie 


Ser 


505 










510 






Cys 


Asn 


Arg 


Ser 


Met 


Ile 


vai 


ser 






525 








Glu 


Gly 


Gly Tyr 


Val 


Leu 


VilU 


Pro 








540 










Thr 


Ala 


Val 


Phe 


Asp 


Phe 


Asn 


f A W 

ser 






555 










3D U 


Ala 


Asn 


Leu 


Ser 


Pro 


Glu 


Lys 


vai 




570 














Asp 


Glu 


Tyr Ala 


Val 


Asp 


lie 


vai 


585 










590 






Pro 


Tyr 


Pro Asp 


Tyr Cys 


Tyr 


nee 










605 








Lys 


Pne 


Ile 


Val 


Met 


Asp 


TV 

Arg 


Arg 






620 










lie 


ASp 


Lys Gly Met Lys 


Cor 


Lys 






635 










640 


Ti- 
ne 


Asn 


Lys 


Asn 


Asn 


Pro 


Val 
v ax 


iiCU 




D3U. 








655 




Tyr 


O i-» 

ber 


Lys 


Lys 


Ile 


Thr 


Tie 
lie 


iU>u 


bob 










670 






ulU 


Arg 


Phe 


Asp 


Phe 


Asn 












685 








Leu 


Gly 


Gin 


Lys 


Cys 


Ile 


Lys 


Tyr 








700 










Arg 


Tyr 


Ile 


Asp 


Asn 


Asn 


Leu 


Tyr 






715 












Ser 


Asn 


Glu 


Pro 


val 


He 


l nr 


Arg 




730 










/J3 




r> L_ A 

Phe 


Thr 


Phe 


Tyr 


Ile 


Ile 


Tyr 


vjiy 


745 










750 






He 


Lys 


Phe 


Asp 


Asn 


Lys 


pne 


Asp 








765 








Ser 


His 


Glu 


Cvs 


Phe 


Gin 


rne 


Leu 








780 










Asn 


He 


Ile 


Leu 


Ser 


Lys 


AFH 


pne 






795 










nnn 


He 


Trp 


Met 


Leu 


Leu 


Leu 


Aia 


i*ys 




810 










815 




Ser 


Ser 


Met 


Asn 


Pro 


Leu 


Gin 


Leu 


825 










830 






Leu 


He 


Arg 


Arg Asp 


Cys 


Thr 


Glu 










845 








Thr 


Ile 


Asp 


Ile 


Leu 


Lys 


Glu 


Tyr 








860 










Asp 


Val 


Asn Asn Lys Ile 


Asn 


Asn 
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865 










870 










875 




880 


Tyr 


Leu 


Met 


Phe 


Thr 


Phe 


Lys 


Asn 


He 


He 


Glu 


Asn He Gin Asn 


Leu 








885 










890 




895 




Asp 


lie 


Asn 


Glu 


Phe 


Lys 


Lys 


Ser 


Val 


Lys 


Tyr Thr Gly He Tyr Lys 








900 










905 






910 




Asp 


Pro 


Asn 


Phe 


Tyr 


lie 


Glu 


Leu 


Cys 


Val 


Lys 


Lys Tyr Asn Ser Lys 






915 










920 








925 




Asn 


Pro 


Asn Asp 


Lys 


He 


Val 


Lys Gly 


Gin 


Arg 


Phe Asn Phe lie 


Tvr 




930 










935 










940 




Ala 


His 


Glu 


He 


Asp 


He 


Trp 


Asp 


He 


Glu 


Thr 


Lys Lys Trp Asn 


Thr 


945 










950 










955 




960 


Lys 


Tyr 


Thr 


Ser 
























(2] 


► INFORMATION FOR SEQ ID 


NO:19: 








(i) SEQUENCE CHARACTERISTICS: 














(A) 


LENGTH: 


763 


amino acids 














(B) 


TYPE : amino 


acid 
















(C) 


STRANDEDNESS : single 














(D) 


TOPOLOGY: linear 














(ii) MOLECULE 


TYPE: peptide 












(iii) 


HYPOTHETICAL: NO 














(iv) ANTISENSE: NO 
















(v) FRAGMENT TYPE : 


: internal 












<vi) ORIGINAL 


SOURCE : 
















(xi) SEQUENCE 


DESCRIPTION; 


; SEQ ID 


NO: 19: 




Met 


Pro 


Leu 


Ser 


Tyr 


Gin 


His 


Phe 


Arg 


Lys 


Leu 


Leu Leu Leu Asp 


Asp 


1 








5 










10 




15 




Glu 


Thr 


Glu 


Ala 


Gly 


Pro 


Leu 


Glu 


Glu 


Glu 


Leu 


Pro Arg Leu Ala 


Asp 








20 










25 






30 




Ala 


Asp 


Leu 


Asn 


Arg 


Arg 


Val 


Ala 


Glu 


Asp 


Leu 


Asn Leu Gly Asn 


Leu 






35 










40 








45 




Asn 


Val 


Ser 


He 


Pro 


Trp 


Thr 


His 


Lys 


Val 


Gly 


Asn Phe Thr Gly 


Leu 




50 










55 










60 




Tyr 


Ser 


Ser 


Thr 


Val 


Pro 


He 


Phe 


Asn 


Pro 


Glu 


Trp Gin Thr Pro 


Ser 


65 










70 










75 




80 


Phe 


Pro 


Lys 


He 


His 


Leu 


His 


Glu 


Asp 


He 


Ala 


Asn Arg Cys Gin 


Gin 








85 










90 




95 




Phe 


Val 


Gly 


Pro 


Leu 


Thr 


Val 


Asn 


Glu 


Lys 


Arg 


Arg Leu Lys Leu 


He 








100 










105 






110 




Met 


Pro 


Ala 


Arg 


Phe 


Tyr 


Pro 


Asn 


Ser 


Thr 


Lys 


Tyr Leu Pro Leu 


Asp 






115 










120 








125 




Lys 


Glv 


lie 


Lys 


Thr 


Tyr 


Tyr 


Pro 


Asp 


His 


Val 


Val Asn His Tyr 


Phe 


130 










135 










140 




Gin 


Thr 


Arg 


His 


Tyr 


Leu 


His 


Thr 


Leu 


Trp 


Lys 


Ala Gly He Leu 


Tyr 


145 










150 










155 




160 


Lys 


Arg 


Glu 


Thr 


Thr 


Arg 


Ser 


Ala 


Ser 


Phe 


Cys 


Gly Ser Pro Tyr 


Ser 






165 










170 




175 




Trp 


Glu 


Gin 


Glu 


Leu 


His 


His 


Gly 


Arg 


Leu 


Val 


He Lys Thr Ser 


Gin 






180 










185 






190 




Arg 


His 


Gly 


Asp 


Glu 


Pro 


Phe 


Cys 


Ser 


Gin 


Pro 


Ser Gly He Leu 


Ser 




195 








200 








205 




Arg 


Ser 


Ser 


Val 


Gly 


Pro 


Cvs 


lie 


Arg 


Ser 


Gin 


Phe Lvs Gin Ser 


Arg 


210 








215 










220 




Leu 


Gly 


Leu 


Gin 


Pro 


His 


Gin 


Gly 


Pro 


Leu 


Ala 


Thr Ser Gin Pro 


Gly 


225 








230 










235 




240 


Arg 


Ser 


Gly 


Ser 


He 


Trp 


Ala 


Arg 


Val 


His 


Ser 


Pro Thr Arg Arg 


Cys 






245 










250 




255 




Phe 


Gly 


Val 


Glu 


Pro 


Ser 


Gly 


Ser 


Gly 


His 


11 


Gly His Arg Ala 


Ser 






260 










265 






270 




Asp 


Ala 


Ser 


Ser 


Cys 


Leu 


His 


Gin 


Ser 


Ala 


Val 


Arg Lys Ala Ala 


Tyr 




275 








280 








285 




Ser 


His 


Leu 


Ser 


Thr 


Ser 


Lvs 


Arg 


Gin 


Ser 


Ser 


Ser Gly His Ala 


Val 




290 










295 










300 




Glu 


Phe 


His 


Ser 


Phe 


Pro 


Pro 


Ser 


Ser 


Ala 


Arg 


Ser Gin Ser Gin 


Gly 


305 










310 










315 




320 
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Pro 


Val 


Phe Ser 




Trn 


Tiro 


Leu Gin Phe Arg 


Asn Thr Gin Pro Cys 








325 






330 


335 


Ser 


Asn 


Tvr Cvs 


Leu 


Ser 


His 


Leu Val Asn Leu Leu Glu Asp Trp Gly 






340 








345 


350 


Pro Cys 


Thr Glu 


His 


Gly 


Glu 


His His He Arg 


He Pro Arg Thr Pro 






355 








360 


365 


Ala Arg 


Val Thr 


Glv 


Glv 


Val 


Phe Leu Val Asp Lys Asn Pro His Asn 




370 








375 




380 


Thr 


Ala 


Glu Ser 


ft y-rr 


Leu 


Val 


Val Asp Phe Ser 


Gin Phe Ser Arg Gly 










390 




395 


400 


Ser 


Thr 


Aro Val 


Ser 


Trn 


Pro 


Lys Phe Ala Val 


Pro Asn Leu Gin Ser 








405 






410 


415 


Leu 


Thr 


Asti Leu 

nan w u 


Leu 


Ser 


Ser 


Asn Leu Ser Trp 


Leu Ser Leu Asp Val 






420 








425 


430 


Ser 


Ala 


Ala Phe 


Tvr 


His 


He 


Pro Leu His Pro 


Ala Ala Met Pro His 






435 








440 


445 


Leu 


Leu 


lie Gly 


Ser 


Ser 


Glv 


Leu Ser Arg Tyr 


Val Ala Arg Leu Ser 




450 








455 




460 


Ser 


Asn 


Ser Arg 


lie 


Asn 


Asn 


Asn Gin His Gly Thr Leu Gin Asn Leu 


465 








470 




475 


480 


His 


Asp 


Ser Cys 


ser 


Arg 




Leu Tyr Val Ser Leu Met Leu Leu Tyr 








A 0 C 






490 


495 


Lys 


Thr 


Tyr Gly 


Trp 


Lys 


Leu 


His Leu Tyr Ser 


His Pro He He Leu 






500 








505 


510 


Gly Phe 


Arg Lys 


lie 


Pro 


Met 


Gly Val Gly Leu 


Ser Pro Phe Leu Leu 






515 








520 


525 


Ala 


Gin 


Phe Thr 


Ser 


Ala 


He 


Cys Ser Val Val 


Arg Arg Ala Phe Pro 




530 








53 5 




540 


His 


Cys 


Leu Ala 


Phe 


Ser 


Tyr 


Met Asp Asp Val 


Val Leu Gly Ala Lys 


545 








ccft 




555 


560 


Ser 


Val 


bill HIS 


Leu 


blU 


C A V 

£>er 


Leu Tyr Thr Ala 


Val Thr Asn Phe Leu 








DO 3 






570 


575 


Leu 


Ser 


Leu Gly 


lie 


HIS 


Leu 


Asn Pro Asn Lys 


Thr Lys Arg Trp Gly 






580 








585 


590 


Tyr Ser 


Leu Asn 






v»iy 


Tyr Val He Gly Ser Trp Gly Thr Leu 














600 


605 


Pro 


Gin 


ASp HIS 


Tl A 

j.xe 


vai 


win 


Lys He Lys His 


Cys Phe Arg Lys Leu 




610 








O ID 




620 


Pro 


Val 


Asn Arg 


Fro 


lie 


Asp 


Trp Lys Val Cys 


Gin Arg Leu Val Gly 


625 












635 


640 


Leu 


Leu 


Giy pne 




Aia 


Pro 


Phe Thr Gin Cvs 


Gly Tyr Pro Ala Leu 






On j 






650 


655 


Met 


Pro 


Leu 1 yr 


Ala 


cys 


lie 


Gin Ala Lys Gin 


Ala Phe Thr Phe Ser 






£ £ n 

DO U 








665 


670 


Pro 


Thr 


Tyr Lys 


Ala 


ftie 


Leu 


Ser Lys Gin Tyr 


Met Asn Leu Tyr Pro 






O / J 








660 


685 


Val 


Ala 




Arg 




Gly 


Leu Cys Gin Val 


Phe Ala Asp Ala Thr 




690 








6 95 




700 


Pro 


Thr 


Gxy Trp 


m v 
uiy 


T .011 


ilia 


He Gly His Gin Arg Met Arg Glu Thr 


705 








710 




715 


720 


Phe 


Val 


Ala Pro 


Leu 


Pro 


He 


His Thr Ala Glu 


Leu Leu Ala Ala Cys 








725 






730 


735 


Phe 


Ala 


Arg Ser 


Arg 


Ser 


Gly 


Ala Lys Leu He 


Gly Thr Asp Asn Ser 






740 








745 


750 


Val 


Val 


Leu Ser 


Gin 


Lys 


Tyr 


Thr Ser Phe Pro 








755 








760 





(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1663 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Gly Pro Thr Ser Gly Ser Gin Leu Leu Val Leu Leu Leu Leu Leu 

15 10 15 

Ala Ser Ser Leu Leu Ala Leu Gly Ser Pro Met Tyr Ser lie lie Thr 

20 25 30 

Pro Asn Val Leu Arg Leu Glu Ser Glu Glu Thr Phe lie Leu Glu Ala 

35 40 45 

His Asp Ala Gin Gly Asp Val Pro Val Thr Val Thr Val Gin Asp Phe 

50 55 60 

Leu Lys Lys Gin Val Leu Thr Ser Glu Lys Thr Val Leu Thr Gly Ala 
65 70 75 80 

Thr Gly His Leu Asn Arg Val Phe lie Lys He Pro Ala Ser Lys Glu 

85 90 95 

Phe Asn Ala Asp Lys Gly His Lys Tyr Val Thr Val Val Ala Asn Phe 

100 105 110 

Gly Ala Thr Val Val Glu Lys Ala Val Leu Val Ser Phe Gin Ser Gly 

115 120 125 

Tyr Leu Phe He Gin Thr Asp Lys Thr He Tyr Thr Pro Gly Ser Thr 

130 135 140 

Val Phe Tyr Arg He Phe Thr Val Asp Asn Asn Leu Leu Pro Val Gly 
145 150 155 160 

Lys Thr Val Val He Val He Glu Thr Pro Asp Gly Val Pro He Lys 

165 170 175 

Arg Asp He Leu Ser Ser His Asn Gin Tyr Gly He Leu Pro Leu Ser 

180 185 190 

Trp Asn He Pro Glu Leu Val Asn Met Gly Gin Trp Lys He Arg Ala 

195 200 205 

Phe Tyr Glu His Ala Pro Lys Gin Thr Phe Ser Ala Glu Phe Glu Val 

210 2i5 220 

Lys Glu Tyr Val Leu Pro Ser Phe Glu Val Leu Val Glu Pro Thr Glu 
225 230 235 240 

Lys Phe Tyr Tyr He His Gly Pro Lys Gly Leu Glu Val Ser He Thr 

245 250 255 

Ala Arg Phe Leu Tyr Gly Lys Asn Val Asp <5ly Thr Ala Phe Val He 

260 265 270 

Phe Gly Val Gin Asp Glu Asp Lys Lys He Ser Leu Ala Leu Ser Leu 

275 280 285 

Thr Arg Val Leu He Glu Asp Gly Ser Gly Glu Ala Val Leu Ser Arg 

290 295 300 

Lys Val Leu Met Asp Gly Val Arg Pro Ser Ser Pro Glu Ala Leu Val 
305 310 315 320 

Gly Lys Ser Leu Tyr Val Ser Val Thr Val He Leu His Ser Gly Ser 

325 330 335 

Asp Met Val Glu Ala Glu Arg Ser Gly He Pro He Val Thr Ser Pro 

340 345 350 

Tyr Gin He His Phe Thr Lys Thr Pro Lys Phe Phe Lys Pro Ala Met 

355 360 365 

Pro Phe Asp Leu Met Val Phe Val Thr Asn Pro Asp Gly Ser Pro Ala 

370 375 380 

Arg Arg Val Pro Val Val Thr Gin Gly Ser Asp Ala Gin Ala Leu Thr 
385 390 395 400 

Gin Asp Asp Gly Val Ala Lys Leu Ser Val Asn Thr Pro Asn Asn Arg 

405 410 415 

Gin Pro Leu Thr He Thr Val Ser Thr Lys Lys Glu Gly He Pro Asp 

420 425 430 

Ala Arg Gin Ala Thr Arg Thr Met Gin Ala Gin Pro Tyr Ser Thr Met 

435 440 445 

His Asn Ser Asn Asn Tyr Leu His Leu Ser Val Ser Arg Val Glu Leu 

450 455 460 

Lys Pro Gly Asp Asn Leu Asn Val Asn Phe His Leu Arg Thr Asp Ala 
465 470 475 480 

Gly Gin Glu Ala Lys He Arg Tyr Tyr Thr Tyr Leu Val Met Asn Lys 

485 490 495 

Gly Lys Leu Leu Lys Ala Gly Arg Gin Val Arg Glu Pro Gly Gin Asp 

S00 505 510 

Leu Val Val Leu Ser Leu Pro He Thr Pro Glu Phe lie Pro Ser Phe 

515 520 525 

Arg Leu Val Ala Tyr Tyr Thr Leu He Gly Ala Asn Gly Gin Arg Glu 
530 535 540 
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Val Val Ala Asp Ser Val Trp Val Asp Val Lys Asp Ser Cys Val Gly 
545 550 555 560 

Thr Leu Val Val Lys Gly Asp Pro Arg Asp Asn Arg Gin Pro Ala Pro 

565 570 575 

Gly His Gin Thr Thr Leu Arg lie Glu Gly Asn Gin Gly Ala Arg Val 

580 585 590 

Gly Leu Val Ala Val Asp Lys Gly Val Phe Val Leu Asn Lvs Lys Asn 

595 600 605 

Lys Leu Thr Gin Ser Lys lie Trp Asp Val Val Glu Lys Ala Asp lie 

610 615 620 

Gly Cys Thr Pro Gly Ser Gly Lys Asn Tyr Ala Gly Val Phe Met Asp 
625 630 635 640 

Ala Gly Leu Thr Phe Lys Thr Asn Gin Gly Leu Gin Thr Asp Gin Arg 

645 650 655 

Glu Asp Pro Glu Cys Ala Lys Pro Ala Ala Arg Arg Arg Arg Ser Val 

660 665 670 

Gin Leu Met Glu Arg Arg Met Asp Lys Ala Gly Gin Tyr Thr Asp Lys 

675 680 685 

Gly Leu Arg Lys Cys Cys Glu Asp Gly Met Arg Asp lie Pro Met Pro 

690 695 700 

Tyr Ser Cys Gin Arg Arg Ala Arg Leu He Thr Gin Gly Glu Ser Cys 
705 710 715 720 

Leu Lys Ala Phe Met Asp Cys Cys Asn Tyr He Thr Lys Leu Arg Glu 

725 730 735 

Gin His Arg Arg Asp His Val Leu Gly Leu Ala Arg Ser Asp Val Asp 

740 745 750 

Glu Asp He He Pro Glu Glu Asp lie He Ser Arg Ser His Phe Pro 

755 760 765 

Glu Ser Trp Leu Trp Thr He Glu Glu Leu Lys Glu Pro Glu Lys Asn 

770 775 780 

Gly He Ser Thr Lys Val Met Asn He Phe Leu Lys Asp Ser He Thr 
785 790 795 800 

Thr Trp Glu He Leu Ala Val Ser Leu Ser Asp Lys Lys Gly He Cys 

805 810 815 

Val Ala Asp Pro Tyr Glu He Thr Val Met Gin Asp Phe Phe He Asp 

820 825 830 

Leu Arg Leu Pro Tyr Ser Val Val Arg Asn Glu Gin Val Glu He Arg 

835 840 845 

Ala Val Leu Phe Asn Tyr Arg Glu Gin Glu Lys Leu Lys Val Arg Val 

850 855 860 

Glu Leu Leu His Asn Pro Ala Phe Cys Ser Met Ala Thr Ala Lys Lys 
865 870 * 875 880 

Arg Tyr Tyr Gin Thr He Glu He Pro Pro Lys Ser Ser Val Ala Val 

885 890 895 

Pro Tyr Val He Val Pro Leu Lys He Gly Leu Gin Glu Val Glu Val 

900 905 910 

Lys Ala Ala Val Phe Asn His Phe He Ser Asp Gly Val Lys Lys He 

915 920 925 

Leu Lys Val Val Pro Glu Gly Met Arg Val Asn Lys Thr Val Ala Val 

930 935 940 

Arg Thr Leu Asp Pro Glu His Leu Asn Gin Gly Gly Val Gin Arg Glu 
945 950 955 960 

Asp Val Asn Ala Ala Asp Leu Ser Asp Gin Val Pro Asp Thr Asp Ser 

965 970 975 

Glu Thr Arg He Leu Leu Gin Gly Thr Pro Val Ala Gin Met Ala Glu 

980 985 990 

Asp Ala Val Asp Gly Glu Arg Leu Lys His Leu He Val Thr Pro Ser 

995 1000 1005 

Gly Cys Gly Glu Gin Asn Met He Gly Met Thr Pro Thr Val He Ala 

1010 1015 1020 

Val His Tyr Leu Asp Gin Thr Glu Gin Trp Glu Lys Phe Gly Leu Glu 
025 1030 1035 1040 

Lys Arg Gin Glu Ala Leu Glu Leu He Lys Lys Gly Tyr Thr Gin Gin 

1045 1050 1055 

Leu Ala Phe Lys Gin Pro He Ser Ala Tyr AI Ala Phe Asn Asn Arg 

1060 1065 1070 

Pro Pro Ser Thr Trp Leu Thr Ala Met Trp Ser Arg Ser Phe Ser Leu 

1075 1080 1085 

Ala Ala Asn Leu He Ala He Asp Ser Gin Val Leu Cys Gly Ala Val 
1090 1095 1100 
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Lys Trp Leu lie Leu Glu Lys Gin Lys Pro Asp Gly Val Phe Gin Glu 
105 1110 1115 1120 

Asp Gly Pro Val lie His Gin Glu Met lie Gly Gly Phe Arg Asn Thr 

1125 1130 1135 

Lys Glu Ala Asp Val Ser Leu Thr Ala Phe Val Leu lie Ala Leu Gin 

1140 1145 1150 

Glu Ala Arg Asp He Cys Glu Gly Gin Val Asn Ser Leu Pro Gly Ser 

1155 1160 1165 

He Asn Lys Ala Gly Glu Tyr Leu Glu Ala Ser Tyr Leu Asn Leu Gin 

1170 1175 1180 

Arg Pro Tyr Thr Val Ala He Ala Gly Tyr Ala Leu Ala Leu Met Asn 
185 1190 1195 1200 

Lys Leu Glu Glu Pro Tyr Leu Thr Lys Phe Leu Asn Thr Ala Lys Asp 

1205 1210 1215 

Arg Asn Arg Trp Glu Glu Pro Gly Gin Gin Leu Tyr Asn Val Glu Ala 

1220 1225 1230 

Thr Ser Tyr Ala Leu Leu Ala Leu Leu Leu Leu Lys Asp Phe Asp Ser 

1235 1240 1245 

Val Pro Pro Val Val Arg Trp Leu Asn Asp Glu Arg Tyr Tvr Gly Gly . 

1250 1255 1260 

Gly Tyr Gly Ser Thr Gin Ala Thr Phe Met Val Phe Gin Ala Leu Ala 
265 1270 1275 1280 

Gin Tyr Arg Ala Asp Val Pro Asp His Lys Asp Leu Asn Met Asp Val 

1285 1290 1295 

Ser Leu His Leu Pro Ser Arg Ser Ser Pro Thr Val Phe Arg Leu Leu 

1300 1305 1310 

Trp Glu Ser Gly Ser Leu Leu Arg Ser Glu <51u Thr Lys Gin Asn Glu 

1315 1320 1325 

Gly Phe Ser Leu Thr Ala Lys Gly Lys Gly Gin Gly Thr Leu Ser Val 

1330 1335 1340 

Val Thr Val Tyr His Ala Lys Val Lys Gly Lys Thr Thr Cys Lys Lys 
345 1350 1355 1360 

Phe Asp Leu Arg Val Thr He Lys Pro Ala Pro Glu Thr Ala Lys Lys 

1365 1370 1375 

Pro Gin Asp Ala Lys Ser Ser Met He Leu Asp He Cys Thr Arg Tyr 

1380 1385 1390 

Leu Gly Asp Val Asp Ala Thr Met Ser He Leu Asp He Ser Met Met 

1395 1400 1405 

Thr Gly Phe He Pro Asp Thr Asn Asp Leu <31u Leu Leu Ser Ser Gly 

1410 1415 1420 

Val Asp Arg Tyr He Ser Lys Tyr Glu Met Asp Lys Ala Phe Ser Asn 
425 1430 1435 1440 

Lys Asn Thr Leu He He Tyr Leu Glu Lys He Ser His Ser Glu Glu 

1445 1450 1455 

Asp Cys Leu Ser Phe Lys Val His Gin Phe Phe Asn Val Gly Leu He 

1460 1465 1470 

Gin Pro Gly Ser Val Lys Val Tyr Ser Tyr Tyr Asn Leu Glu Glu Ser 

1475 1480 1485 

Cys Thr Arg Phe Tyr His Pro Glu Lys Asp Asp Gly Met Leu Ser Lys 

1490 1495 1500 

Leu Cys His Asn Glu Met Cys Arg Cys Ala Glu Glu Asn Cvs Phe Met 
505 1510 1515 " 1520 

His Gin Ser Gin Asp Gin Val Ser Leu Asn Glu Arg Leu Asp Lvs Ala 

1525 1530 1535 

Cys Glu Pro Gly Val Asp Tyr Val Tyr Lys Thr Lys Leu Thr Thr He 

1540 1545 1550 

Glu Leu Ser Asp Asp Phe Asp Glu Tyr He Met Thr He Glu Gin Val 

1555 1560 1565 

He Lys Ser Gly Ser Asp Glu Val Gin Ala Gly Gin Glu Arg Arg Phe 

1570 1575 1580 

He Ser His Val Lys Cys Arg Asn Ala Leu Lys Leu Gin Lys Gly Lys 
585 1590 1595 1600 

Gin Tyr Leu Met Trp Gly Leu Ser Ser Asp Leu Trp Gly Glu Lys Pro 

1605 1610 1615 

Asn Thr Ser Tyr He He Gly Lys Asp Thr Trp Val Glu His Trp Pro 

1620 1625 1630 

Glu Ala Glu Glu Arg Gin Asp Gin Lys Asn Gin Lys Gin Cys Glu Asp 

1635 1640 1645 

Leu Glv Ala Phe Thr Glu Thr Met Val Val Phe Gly Cys Pro Asn 
1650 1655 1660 
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(2) INFORMATION FOR SEQ ID NO: 21; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1666 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 





(vi) ORIGINAL 


SOURCE : 


















(xi) i 


SEQUENCE 


DESCRIPTION 


: SEQ ID NO:! 


21: 








Met 


L»iy 


Pro 


Ala 


ilia 




Dm 


Car 
OC X 


T.oi 1 


T .01 i T .01 1 


Leu 


Leu 


Leu 


Ala Ser 


1 






'c 








10 








15 


Val 


COT" 


Leu 


nla 


Leu 


\j J. y 


7\ cn 


nu 




1 y t o ex 


He 


He 


Thr 


Pro Asn 


























30 




He 


T n* * 

Leu 


Arg 


Leu 


G1U 


Asn 


U1U 


bill 


inr 


\7=i 1 17a 1 

vai vai 


Leu 


Glu 


Ala 


His Glu 






35 










4 ft 
4 0 








45 






Val 


uin 


Gly 


Asp 


lie 


Pro 




mr 


t/al 


inr vai 


His Asp Phe 


Pro Ala 




50 


















60 








Lys 


Lys 


Asn 


Val 


Leu 


Ser 


Ser 


Glu 


Lys 


Thr Val 


Leu 


Thr 


Ser 


Ala Thr 


65 








70 








/ b 








80 


Gly 


Tyr 


Leu 


Gly 


Thr 


Val 


Thr 


lie 


Lys 


lie Pro 


Ala Ser Lys 


Glu Phe 














qn 








95 


Lys 


Ser 


Asp 


Lys 


Gly 


Arg 


Lys 


Leu 


Val 


Val Val 


Gin 


Ala 


Ala 


Phe Gly 








inn 
100 










1 ft c 








110 




Gly 


Thr 


Gin 


Leu 


Glu 


Lys 


val 


vai 


T nil 

Leu 


vai ser 


Leu 


Gin 


Ser 


Gly Tyr 






tic 

115 










0 








125 






Leu 


Pne 


lie 


Gin 


Thr 


Asp 


Lys 


i nr 


lie 


Tyr Thr 


Pro Gly Ser Thr Val 




i ^ n 


















140 








Leu 


Tyr 


Arg 


lie 


Pne 


Thr 


vai 


Asp 


Ser 


Asp Leu 


Leu 


Pro 


Val 


Gly Arg 


145 










1 c ft 








ice: 








160 


Thr 


lie 


lie 


vai 


Tnr 


lie 




inr 


Pro 




He 


Pro 


He 


Lys Arg 










ICC 

lbs 










1 I \J 








175 


Asp 


Tnr 


Leu 


C a v 

ber 


ber 


Asn 


Asn 


uin 


Hi c 


vjiy lie 


Leu 


Pro 


Leu 


Ser Trp 






1 a n 
180 


















190 




Asn 


lie 


Pro 


Glu 


Leu 


vai 


Asn 


Mec 


vjiy 


_ 

Gin Trp 


Lys 


lie 


Gin 


Ala Phe 






195 










inn 
ZUU 








205 






Tyr 


Glu 


Asn 


Ser 


Pro 


Lys 


uin 


v a j. 


fne 


C*a>- Ala 
del Ala 


Glu 


Phe 


Glu 


Val Lys 


210 


















220 








Glu 


Tyr 


Val 


Leu 


Pro 


Ser 


nl,. 


G1U 


^/ai 


L>eu vai 


Glu 


Pro 


Thr 


Glu Lys 


225 








i J 0 
















240 


Phe 


Tyr 


Tyr 


lie 


Aso 


Asp 


rl (J 


T vc 


vjx y 


T .01 1 f3 1 1 1 
IJCU ox u 


Val 


Asn 


He 


He Ala 






24*5 










250 








255 


Arg 


Phe 


Leu 


Tyr 


Gly 


Lys 


Asn 


Val 


Asp 


Gly Thr 


Ala 


Phe 


Val 


He Phe 






260 










265 








270 




Gly 


Val 


Gin 


Asp 


Gly 


Asp 


Gin 


Arg 


lie 


Ser Leu 


Ala 


Gin 


Ser 


Leu Thr 






275 










280 








285 






Arg 


Val 


Val 


He 


Glu 


Asp 


Gly 


Ser 


Gly 


Glu Val 


Val 


Leu 


Ser 


Arg Gin 


290 










295 








300 








Val 


Leu 


Leu 


Asp 


Gly 


Val 


Gin 


Pro 


Ser 


Arg Pro 


Glu 


Ala 


Leu 


Val Gly 


305 








310 








315 








320 


Lys 


Ser 


Leu 


Tyr 


Val 


Ser 


Val 


Thr 


Val 


He Leu 


His 


Ser Gly 


Ser Asp 






325 










330 








335 


Met 


Val 


Glu 


Ala 


Glu 


Arg 


Ser 


Gly 


He 


Pro He 


Val 


Thr 


Ser 


Pro Tyr 








340 










345 








350 




Gin 


He 


His 


Phe 


Thr 


Lys 


Thr 


Pro 


Lys 


Tyr Phe 


Lys 


Pro 


Ala 


Met Pro 






355 








360 








365 






Phe 


Glu 


He 


Met 


val 


Leu 


Val 


Thr 


Asn 


Pro Asp 


Gly Ser Pro 


Ala Pro 




370 










375 








380 








His 


Val 


Pro 


Val 


Val 


Thr 


Gin 


Gly 


Ser 


Asn Val 


Gin 


Ser 


Leu 


Thr Gin 


365 










390 








395 








400 


Ala 


Asp 


Gly 


Val 


Ala 


Arg 


Leu 


Ser 


He 


Asn Thr 


Pro Asn Thr Arg Gin 






405 










410 








415 


Pro 


Leu 


Ser 


Val 


Thr 


Val 


Gin 


Thr 


Lvs 


Lys Gly 


Gly He Pro Asp Ala 








420 










425 








430 
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Arg 


Gin 


Ala 


He 


Asn 


Thr 


Met 


Gin 




435 










440 


Asn 


Ser 


Asn 


Asn 


Tyr 


Leu 


His 


Leu 




450 










455 




Pro 


Gly 


Glu 


Thr 


He 


Asn 


Val 


Asn 


465 










470 






Gin 


Glu 


Ala 


Lys 


He 


Arg 


Tyr 


Tyr 










485 








Lys 


Leu 


Leu 


Lys 


Val 


Gly 


Arg 


Gin 








500 










Val 


Val 


Leu 


Pro 


Met 


Pro 


He 


Thr 






515 










520 


Leu 


Val 


Ala 


Tyr 


Tyr 


Thr 


Leu 


He 




530 










535 




Val 


Ala 


Asp 


Ser 


Val 


Trp 


Ala 


Asp 


545 










550 






Leu 


Val 


Val 


Lys 


Gly 


Gly 


Ser 


Gly 










565 








Gin 


His 


Leu 


Pro 


Arg 


Gin 


Gin 


Met 








580 










Gly 


Ala 


Arg 


Val 


Gly 


Leu 


Val 


Ala 




595 










600 


Asn 


Lys 


Lys 


His 


Lys 


Leu 


Thr 


Gin 




610 










615 




Lys 


Ala 


Asp 


He 


Gly 


Cys 


Thr 


Pro 


625 










630 






Val 


Phe 


Thr 


Asp 


Ala 


Gly 


Leu 


Ser 










645 








Gin 


Thr 


Ala 


Gin 


Arg 


Glu 


Gly 


Leu 








660 










Arg 


Arg 


Arg 


Ser 


Val 


Gin 


Leu 


Met 






675 










680 


Lys 


Tyr 


Lys 


Ser 


Lys 


Glu 


Leu 


Arg 




690 










695 




Glu 


Asn 


Pro 


Met 


Gin 


Phe 


Ser 


Cys 


705 










710 






Leu 


Gly 


Glu 


Ala 


Cys 


Val 


Lys 


Ala 








725 








Ala 


Gin 


Leu 


Arg 


Gin 


Gin 


His 


Arg 








740 










Arg 


Ser 


Asp 


Met 


Asp 


Glu 


Asp 


lie 




755 










760 


Arg 


Ser 


Gin 


Phe 


Pro 


Glu 


Ser 


Trp 


770 










775 




Glu 


Pro 


Glu 


Arg 


Asn 


Gly 


He 


Ser 


7B5 










790 






Lys 


Asp 


Ser 


He 


Thr 


Thr 


Trp 


Glu 










805 








Lys 


Lys 


Gly 


He 


Cys 


Val 


Ala 


Asp 








820 










Asp 


Phe 


Phe 


He 


Asp 


Leu 


Arg 


Leu 




835 










840 


Gin 


Val 


Glu 


He 


Arg 


Ala 


Val 


Leu 




850 










855 




Leu 


Lys 


Val 


Arg 


Val 


Glu 


Leu 


Leu 


865 










870 






Ala 


Thr 


Ala 


Lys 


Lys 


Arg 


His 


Thr 










885 








Ser 


Ser 


Val 


Ala 


Val 


Pro 


Tyr 


Val 








900 










Gin 


Glu 


Val 


Glu 


Val 


Lys 


Ala 


Ala 






915 










920 


Gly 


Val 


Lys 


Lys 


Thr 


Leu 


Lys 


Val 


930 










935 




Lys 


Thr 


Val 


Ala 


He 


Arg 


Thr 


Leu 


945 










950 






Gly 


Val 


Gin 


Arg 


Glu 


Glu 


He 


Pro 








965 








Pro 


Asp 


Thr 


Asp 


Ser 


Glu 


Thr 


Lys 








980 











•75- 



Ala 


Leu 


Pro 


Tyr 


Thr 


Thr 


Met 


Tyr 










445 








Ser 


Met 


Pro 


Arg 


Thr 


Glu 


Leu 


Lys 








460 










Phe 


His 


Leu 


Arg 


Ser Asp 


Pro 


Asn 






475 










480 


Thr 


Tyr 


Leu 


He 


Met 


Asn 


Lys 


Gly 




490 










495 




Pro 


Arg 


Glu 


Pro Gly Gin Ala 


Leu 


505 










510 






Lys 


Glu 


Leu 


He 


Pro 


Ser 


Phe 


Arg 










525 








Gly 


Ala 


Ser 


Ala Gin Arg Glu 


Val 








540 










Val 


Arg 


Asp 


Ser 


Cys 


Val 


Gly 


Thr 






555 










560 


Lys 


Asp 


Gly 


Gin Asp Lys Arg 


Gin 




570 










575 




Thr 


Leu 


Arg 


He Glu Gly Asn 


Gin 


585 










590 






Val 


Asp 


Lys 


Gly Val 


Phe 


Val 


Leu 










605 








Ser 


Lys 


He 


Trp Asp 


Val 


Val 


Glu 








620 










Gly 


Ser 


Gly 


Lys Asp Tyr Ala 


Gly 






635 










640 


Phe 


Lys 


Ser 


Ser Lys Ala Gly 


Leu 




650 










655 




Asp 


Cys 


Pro 


Lys 


Pro 


Ala 


Ala 


Arg 


665 










670 






Glu 


Arg 


Arg 


Met Asp Lys Ala 


Gly 










685 








Arg 


Cys 


Cys 


Glu Asp Gly Met 


Arg 








700 










Gin 


Arg 


Arg 


Ala Arg Tyr Val 


Ser 






715 










720 


Phe 


Leu 


Asp 


Cys Cys Thr Tyr 


Met 




730 










735 




Arg 


Glu 


Gin 


Asn Leu Gly Leu 


Ala 


745 










750 






He 


Pro 


Glu 


Glu Asp 


He 


He 


Ser 










765 








Leu 


Trp 


Thr 


He 


Glu 


Glu 


Leu 


Lys 








780 










Thr 


Lys 


Thr 


Met 


Asn 


lie 


Phe 


Leu 




795 










800 


He 


Leu 


Ala 


Val 


Ser 


Leu 


Ser 


Asp 




810 










815 




Pro 


Phe 


Glu 


Val 


Thr 


Val 


Met 


Gin 


825 










830 






Pro 


Tyr 


Ser 


Val 


Val 


Arg 


Asn 


Glu 








845 








Tyr 


Asn 


Tyr 


Arg 


Glu 


Ala 


Gin 


Ser 








860 










His 


Asn 


Pro 


Ala 


Phe 


Cys 


Ser 


Leu 






875 










880 


Gin 


Thr 


Val 


Thr 


He 


Gly 


Pro 


Lys 




890 










895 




Leu 


Val 


Pro 


Leu 


Lys 


He 


Gly 


Leu 


905 










91C 






Val 


Tyr 


Asn 


Tyr 


Phe 


He 


Ser 


Asp 










925 








Val 


Pro 


Glu 


Gly Met Arg 


Val 


Asn 








940 










Asn 


Pro 


Glu 


Gin Leu Gly Gin 


Gly 






955 










960 


Ala 


Ala 


Asp 


Leu 


Ser 


Asp 


Gin 


Val 




970 








975 




He 


Leu 


Leu 


Gin Gly Thr 


Pro 


Val 


985 










990 
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Ala Gin Met Ala Glu Asp Ala Val Asp Ala Glu Arg Leu Lys His Leu 

995 1000 1005 

He lie Thr Pro Ser Gly Cys Gly Glu Gin Asn Met He Gly Met Thr 

1010 1015 1020 

Pro Thr Val He Ala Val His Tyr Leu Asp Gin Thr Glu Gin Trp Glu 
025 1030 1035 1040 

Lys Phe Gly Leu Glu Lys Arg Gin Glu Ala Leu Asn Leu He Asn Arg 

1045 1050 1055 

Gly Tvr Thr Gin Gin Leu Ala Phe Lys Gin Pro Asn Trp Ala Tyr Ala 

1060 1065 1070 

Ala Phe Lys Asn Arg Ala Ser Ser Thr Trp Leu Thr Ala Tyr Val Val 

1075 1080 1085 

Lys Val Phe Ser Leu Ala Ala Asn Leu He Gly He Asp Ser Glu Val 

1090 1095 1100 

Leu Cys Gly Ala Val Lys Trp Leu He Leu Glu Lys Gin Lys Pro Asp 
105 1110 1115 1120 

Gly Val Phe Gin Glu Asd Gly Pro Val He His Gin Glu Met He Gly 

1125 " 1130 1135 

Gly Val Arg Thr Ala Gin Glu Ala Asp Val Ser Leu Thr Ala Phe Val 

1140 1145 1150 

Leu He Ala Leu Gin Glu Ala Lys Asp He Cys Arg Ala Gin Val Asn 

1155 1160 1165 

Asn Leu Glu Ala Asn He Asn Lys Ala Gly Asp Tyr He Glu Ser Arg 

1170 H75 1180 

Tyr Ala Asp Val Arg Arg Pro Tyr Thr Leu Ala He Ala Gly Tyr Ala 
185 1190 1195 1200 

Leu Ala Leu Leu Glu Arg Leu Asn Gly Ala Thr Leu Gin Lys Phe Leu 

1205 1210 1215 

Asn Ala Ala Thr Glu Lys Asn Arg Trp Glu Glu Ala Arg Gin Lys Leu 

1220 1225 1230 

Tyr Ser Val Glu Ala Thr Ser Tyr Ala Leu Leu Ala Leu Leu Leu Leu 

1235 1240 1245 

Lys Asp Phe Asp Ala Val Pro Pro Val Val Arg Trp Leu Asn Glu Gin 

1250 1255 1260 

Arg Tyr Tyr Gly Arg Gly Tyr Gly Ser Thr Gin Ala Thr Phe Met Val 
265 1270 1275 1280 

Phe Gin Ala Leu Ala Gin Tyr Gin Thr Asp Val Pro Asp His Lys Asp 

1285 1290 1295 

Leu Asn Met Glu Val Ala Leu Gin Leu Pro Ser Arg Ser Ser Pro Ser 

1300 1305 1310 

Lys Phe Arg Leu Val Trp Glu Ala Gly Ser Leu Leu Arg Ser Glu Ala 

1315 1320 1325 

Thr Lys Gin Asn Glu Gly Phe Lys Leu Thr Ala Lys Gly Lys Gly Gin 

1330 1335 1340 

Gly Thr Leu Ser Val Val Ala Val Tyr Tyr Ala Lys Thr Lys Arg Lys 
345 1350 1355 1360 

Val Val Cys Lys Asn Phe Asp Leu Arg Val Thr Leu Lys Pro Ala Pro 

1365 1370 1375 

Asp Thr Val Lys Lys Pro Gin Glu Ala Lys Ser Thr Met He Leu Gly 

1380 1385 1390 

He Cys Thr Arg Tyr Leu Gly Asp Gin Asp Ala Thr Met Ser He Leu 

1395 1400 1405 

Asp He Ser Met Met Thr Gly Phe He Pro Asp Thr Asp Asp Leu Lys 

1410 1415 1420 

Leu Leu Ala Thr Gly Val Asp Arg Tyr He Ser Lys Tyr Glu Met Asn 
425 1430 1435 1440 

Lys Aso Phe Ser Lvs Asn Thr Leu He He Tyr Leu Asp Lys Val Ser 

1445 1450 1455 

His Ser Glu Glu Glu Cys Leu Ser Phe Lys lie His Gin Phe Phe Asn 

1460 1465 1470 

Val Gly Leu He Gin Pro Gly Ser Val Lys Val Tyr Ser Tyr Tyr Asn 

1475 1480 1485 

Leu Asp Glu Thr Cys Thr Gin Phe Tyr His Pro Glu Lys Glu Asp Gly 

1490 1495 1500 

Met Leu Asn Lys Leu Cys His Lys Asp Leu Cys Arg Cys Ala Glu Glu 
505 1510 1515 1520 

Asn Cys Phe He Gin Leu Pro Glu Lys He Thr Leu Asp Glu Arg Leu 

1525 1530 1535 

Glu Lvs Ala Cvs Glu Pro Gly Val Asp Tyr Val Tyr Lys Thr Lys Leu 
15^0 1545 1550 
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Leu Lys Met Glu Leu Ser Asp Asp Phe Asp Glu Tyr lie Met Thr lie 

1555 1560 1565 

Glu Gin Val lie Lvs Ser Gly Ser Asp Glu Val Gin Ala Gly Lys Glu 

1570 " 1575 1580 

Arg Arg Phe He Ser His He Lys Cys Arg Asp Ala Leu His Leu Lys 
585 1590 1595 1600 

Glu Gly Lys His Tyr Leu Met Trp Gly Leu Ser Ser Asp Leu Trp Gly 

1605 1610 1615 

Glu Arg Pro Asn Met Ser Tyr He He Gly Lys Asp Thr Trp Val Glu 

1620 " 1625 1630 

Ala Trp Pro Glu Ala Glu Glu Cys Gin Asp Glu Glu Asn Gin Gin Gin 

1635 1640 1645 

Cys Gin Asp Leu Gly Thr Phe Thr Glu Asn Met Val Val Phe Gly Cys 

1650 ' 1655 1660 

Pro Asn 
665 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1015 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(IV) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 



Phe 


Phe 


Arg 


Glu 


Asp 


Leu 


Ala 


Phe 


Leu 


Gin 


Gly 


Lys 


Ala 


Arg 


Glu 


Phe 


1 






5 










10 










15 




Ser 


Ser 


Glu 


Gin 


Thr 


Arg 


Ala 


Asn 


Ser 


Pro 


Thr 


He 


Ser 


Ser 


Glu 


Gin 








20 








25 










30 






Thr 


Arg 


Ala 


Asn 


Ser 


Pro 


Thr 


Arg 


Arg 


Glu 


Leu 


Gin 


Val 


Trp 


Gly 


Arg 




35 










40 










45 








Asp 


Asn 


Asn 


Ser 


Pro 


Ser 


Glu 


Ala 


Gly 


Ala 


Asp 


Arg 


Gin 


Gly 


Thr 


Val 


50 










55 










60 










Ser 


Phe 


Asn 


Phe 


Pro 


Gin 


He 


Thr 


Leu 


Trp 


Gin 


Arg 


Pro 


Leu 


Val 


Thr 


65 










70 










75 










80 


He 


Lys 


He 


Gly 


Gly 


Gin 


Leu 


Lys 


Glu 


Ala 


Leu 


Leu 


Asp 


Thr 


Gly 


Ala 






85 










90 










95 




Asp 


Asp 


Thr 


Val 


Leu 


Glu 


Glu 


Met 


Ser 


Leu 


Pro 


Gly 


Arg 


Trp 


Lys 


Pro 




100 










105 










110 






Lys 


Met 


He 


Gly 


Gly 


He 


Gly 


Gly 


Phe 


lie 


Lys 


val 


Arg 


Gin 


Tyr 


Asp 




115 








120 










125 








Gin 


He 


Leu 


He 


Glu 


He 


Cys 


Gly 


His 


Lys 


Ala 


lie 


Gly 


Thr 


Val 


Leu 




130 










135 










140 








Gin 


Val 


Gly 


Pro 


Thr 


Pro 


Val 


Asn 


He 


He 


Gly 


Arg 


Asn 


Leu 


Leu 


Thr 


145 








150 










155 










160 


He 


Gly 


Cys 


Thr 


Leu 


Asn 


Phe 


Pro 


lie 


Ser 


Pro 


He 


Glu 


Thr 


Val 


Pro 






165 










170 










175 




Val 


Lys 


Leu 


Lys 


Pro 


Gly 


Met 


Asp 


Gly 


Pro 


Lys 


Val 


Lys 


Gin 


Trp 


Pro 






180 










185 










190 






Leu 


Thr 


Glu 


Glu 


Lys 


lie 


Lys 


Ala 


Leu 


Val 


Glu 


He 


Cys 


Thr 


Glu 


Met 






195 








200 










205 








Glu 


Lys 


Glu 


Gly 


Lys 


Il« 




I.ys 


He 


Gly 


Pro 


Glu 


Asn 


Pro 


- J — 


Asn 




2*10 








215 










220 








Lys 


Thr 


Pro 


Val 


Phe 


Ala 


He 


Lys 


Lys 


Lys 


Asp 


Ser 


Thr 


Lys 


Trp 


Arg 


225 










230 










235 










240 


Leu 


Val 


Asp 


Phe 


Arg 


Glu 


Leu 


Asn 


Lys 


Arg 


Thr 


Gin 


Asp 


Phe 


Trp 


Glu 






245 










250 










255 




Val 


Gin 


Leu 


Gly 
260 


He 


Pro 


His 


Pro 


Ala 
265 


Gly 


Leu 


Lys 


Lys 


Lvs 
270 


Lys 


Ser 


Val 


Thr 


Val 


Leu 


Asp 


Val 


Gly 


Asp 


Ala 


Tyr 


Phe 


Ser 


Val 


Pro 


Leu 


Asp 






275 








280 










285 








Glu 


Asp 


Phe 


Arg 


Lys 


Tyr 


Thr 


Ala 


Phe 


Thr 


He 


Pro 


Ser 


lie 


Asn 


Asn 
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290 






295 








300 








Glu 


Thr 


Pro 


Glv 


He Arg Tyr Gin 


Tyr Asn 


Val 


Leu Pro 


Gin Gly Trp 


305 








310 






315 








320 


Lys 


Glv 


Ser 


Pro 


Ala He Phe 


Gin 


Ser Ser 


Met 


Thr Lys 


lie 


Leu 


Glu 






325 




330 








335 




pro 


XrilXZ 


Lys 


Lys 


Gin Asn Pro 


Asp 


Tip Va 1 

A. X C V CLX 


Tip 

X X c 


1 y 1. \3JLiX 


Tyr Met 


Asp 








340 






345 






350 








l*eu 


Tyr 


Vai 


Gly Ser Asp 


Leu 


uX U XX c 


vsx y 


m n H*i e; 

UJ.U nia 


Arg 


Thr 


Lys 






355 






360 






365 








Tip 

x x c 




Glu Leu Arg Gin His 


Leu 


T .01 1 7A ypr 
ucu ax y 


irp 


(7} V T.Oll 

oiy ucu 


Thr 


Thr 


Pro 




370 






375 








380 










T ,vc 


Lys 


His 


Gin Lys Glu 


Pro 


Pro Phe 


Leu 


Trp Met 


Gly Tyr Glu 


385 








390 






395 








400 




Hies 
nxo 


Pro Asp 


Lys Trp Thr 


Vai 


Gin Pro 


He 


Val Leu 


Pro Glu Lys 










405 




410 








415 






Cay 
wCX 


Trp 


Thr 


Vai Asn Asp 


He 




Leu 


Val Gly 


Lys 


Leu 


Asn 








420 






425 






430 






irp 


Ala 
Ala 


Ser 


Gin 


He Tyr Pro Gly 


He Lys 


Val 

V AX 


7A ttt dl ri 


Leu Cys 


Lys 






435 






440 






445 








Leu 


T 

lr€U 


Arg Gly Thr Lys Ala 


Leu 


1 ILL ulU 


Val 


He Pro 


Leu 


Thr 


Glu 










455 
















bill 


Ala 


Glu 


Leu 


Glu Leu Ala 


Glu 


Asn Arg 


ulU 


Xle Leu 


Lys 


Glu 


Pro 


>t £ c 








470 






*k f 3 








480 


vai 


His 


Gly Vai 


Tyr Tyr Asp 


Pro 


Ser Lys 


Asp 


Leu lie 


Ala 


Glu 


He 










485 




a on 








495 




Gill 


Lys 


Gin Gly Gin Gly Gin Trp 


Thr Tyr 


Gin 


lie lyT 


Gin 


Glu 


Pro 








500 












510 






Pne 


Lys 


Asn 


Leu 


Lys Thr Gly Lys 


Tyr Ala 


Arg 


Met Arg 


Gly Ala His 






515 






520 














Tax 


Asn 


Asp 


Vai 


Lys Gin Leu 


Thr 


G1U Ala 


vax 


Gin JjyS 


He 


Thr 


Thr 




530 




535 








RAO 








GIU 


C A V 


He 


Vai 


He Trp Gly Lys 


inr Fro 


Lys 


irne i/ys 


Leu 


Pro 


He 


CAE 








550 






ccc 

33 3 








560 


Gin 


Lys 


Glu Thr Trp Glu Thr Trp 


Trp Txir 


G1U 


lyr irp 


Gin 


Ala 


Thr 










565 




570 
3 / \j 








575 




Trp 


ne 


Pro Glu Trp Glu Phe Vai 


Asn Thr 


Pro 


Di*n T .on 
irxO ucu 


Val 


Lys 


Leu 






580 












590 






Trp 


Tyr 


Gin Leu Glu Lys Glu Pro 


lie vol 


wxy 


Ala. ulU 


Thr 


Phe 


Tyr 


595 






600 






605 








vai 


Asp 


Gly Ala Ala Asn Arg 


Glu 


i nr xiys 


Leu 


Gly Lys 


Ala Gly Tyr 




olU 






615 
















vai 


Thr 


Asn 


Lys 


Gly Arg Gin 


Lys 


vax vox 


Pro 


Leu Thr 


Asn 


Thr 


Thr 


© 






630 






O J 3 








640 


Asn 


Gin 


Lys 


Thr 


Glu Leu Gin 


Ala 


Tin rrv, 

lie lyr 


Leu 


Ala Leu 


Gin 


Asd 


Ser 








645 












655 




Gly 


Leu 


Glu 


Vai 


Asn He Vai 


Thr 


Asp Ser 


Gin 


lyr Aia 


Leu Gly 


He 






660 






665 






670 






Xle 


Gin 


Ala 


Gin 


Pro Asp Lys 


Ser 




(jlu 


Ton \7a 1 
Leu Vax 


Asn 


Gin 


He 






675 






660 






O O W 








lie 


Glu 


Gin 


Leu 


He Lys Lys 


Glu 


ijys vai 


Tyr 


Leu Ala 


Trp 


Val 


Pro 




con 






695 
















Ala 


HIS 


Lys 


Gly 


He Gly Gly Asn 


uiu uin 


VoX 




Leu 


Val 


Ser 


705 






710 






715 








720 


Ala 


Gly 


He 


Arg 


Lys He Leu 


Phe 


T on 7\ en 


iji y 


Tlo A cn 
11c Map 


Lys 


Ala 


Gin 








725 




7^ n 








735 




Asp 


Glu 


His 


Glu 


Lys Tyr His 


Ser 


Asn Trp 


Arg 


21 1 a Mot" 

Ma net 


Ala 


Ser 


Asp 






74 0 






'13 






750 






Pne 


Asn 


Leu 


Pro 


Pro Vai Vai 


Ala 


Lys Glu 


Tip 
lie 


Vai nia 


Ser 


Cys 


Asp 






755 






760 














Lys 


Cys 


Gin 


Leu 


Lys Glv Glu 


Ala 


net his 


pl v 

oiy 


UlU Val 


& cr\ 
• — c 




Ser 


Tin 

/ /U 






V 775 








780 








Pro 


Gly 


He 


Trp 


Gin Leu Asp 


Cys 


inr xixs 


Leu 




Lys 


Val 


lie 


785 




790 






795 








800 


Leu 


Vai 


Ala 


Vai 


His Vai Ala 


Ser 


Gly Tyr 


He 


Glu Ala 


Glu 


Val 


lie 










805 




810 








815 




Pro 


Ala 


Glu 


Thr Gly Gin Glu 


Thr 


Ala Tyr 


Phe 


Leu Leu 


Lys 


Leu 


Ala 








820 






825 






830 






Gly 


Arg 


Trp 


Pro 


Vai Lys Thr 


He 


His Thr 


Asp 


Asn Gly 


Ser 


Asn 


Phe 






B35 






840 






845 








Thr 


Ser 


Ala 


Thr 


Vai Lys Ala 


Ala 


Cys Trp 


Trp 


Ala Gly 


He 


Lys 


Gin 
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Pro 


Tyr 
870 


Asn 


Pro 


Pin 


Ser 


VJ J-Ii 

O / 3 


y 


Val Val Glu Ser 
880 


Leu 


Lys 


Lys 


He 


He 


Gly 


Gin 


Val 


Arg Asp Gin Ala 


885 














895 


Thr 


Ala 


Val 


C 1 n 


rlct 




V CtX 


Ph*=» 
riic 


He His Asn Phe 








one 








910 


Gly 


He 


Gly 


biy 


Tyr 


Ser 


Til 3 


oxy 


m ti Ara He Val 






920 












Thr Asd 


lie 


bin 


x ni 


T \r c 
JUV& 


UlU 


iJCU 


filn Lvs Gin He 






935 










94 0 




Asn 


Phe 
950 


Arg 


Val 


Tyr 


Tyr 


Arg 
955 


Asp 


Ser Arg Asn Pro 
960 


Pro 


Ala 


Lys 


Leu 


Leu 


Trp 


Lys 


Gly 


Glu Gly Ala Val 


965 








970 






975 


Asn 


Ser 


Asp 


He 


Lys 


Val 


Val 


Pro 


Arg Arg Lys Ala 








985 








990 


Asp 


Tyr 


Gly 


Lys 


Gin 


Met 


Ala 


Gly 


Asp Asp Cys Val 


1000 








1005 



850 855 860 

Glu Phe Gly He 
865 

Met Asn Lys Glu 

Glu His Leu Lys 
900 

Lys Arg Lys Gly 
915 

Asp He He Ala 
930 

Thr Lys He Gin 
945 



980 

Lys He He Arg 
995 

Ala Ser Arg Gin Asp Glu Asp 
1010 1015 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 
<iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Thr Glv Gly Phe Phe Arg Asp Trp Pro Leu Gly Lys Glu Ala Pro Gin 

1 5 1° 15 

Phe Pro Arg Gly Pro Ser Ser Thr Gly Ala Asn Thr Asn Ser Thr Pro 

20 25 30 

He Gly Ser Ser Ser Gly Ser Thr Gly Glu lie Tyr Ala Ala Arg Glu 

35 40 45 

Lvs Ala Glu Gly Ala Glu Thr Glu Thr He Gin Arg Gly Asp Arg Gly 

50 55 60 

Leu Thr Ala Pro Arg Thr Arg Arg Gly Pro Met Gin Gly Asp Asn Arg 
65 70 75 80 

Gly Leu Ala Ala Pro Gin Phe Ser Leu Trp Lys Arg Pro Val Val Thr 

85 9° 95 

Ala His He Glu Gly Gin Pro Val Glu Val Leu Leu Asp Thr Gly Ala 

100 HO 
Asd Asp Ser He Val Ala Gly He Glu Leu Gly Ser Asn Tyr Ser Pro 

* !15 120 125 

Lvs He Val Gly Gly He Gly Gly Phe He Asn Thr Lys Glu Tyr Lys 
130 135 140 

Glu He Glu Val Leu Gly Lys Arg Val Arg 
145 150 155 160 

pro He Asn He Phe Gly Arg Asn He Leu Thr 
165 170 I 75 



Asn Val Glu He Glu Val Leu Gly Lys Arg Val Arg Ala Thr He Met 

150 155 loO 

Thr Gly Asp Thr Pro He Asn He Phe Gly Arg Asn lie Leu Thr Ala 

165 170 , 5 

Leu Gly Met Ser Leu Asn Leu Pro Val Ala Lys He Glu Pro He Lys 

180 185 190 

He Met Leu Lys Pro Gly Lys Asp Gly Pro Arg Leu Arg Gin Trp Pro 

195 200 205 

Leu Thr Lys Glu Lys He Glu Ala Leu Lys G . l He Cys Glu Lys Met 

210 215 220 

Glu Lys Glu Gly Gin Leu Glu Glu Ala Pro Pro Thr Asn Pro Tyr Asn 

Thr Pro Thr Phe Ala He Arg Lys Lys Asp Lys Asn Lys Trp Arg Met 
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Leu 


T 1 p» 
lie 


Asp 


Phe 


lira 


Glu 


Leu 


Asn 


Lvs 








260 










265 


lie 


Gin 


Leu 


Glv 


He 


Pro 


His 


Pro 


Ala 






275 










280 




Tip 

lie 


Thr 

J. 111 


Val 


Leu 


Asd 


Val 


Glv 

ux y 


Asd 


Ala 




290 










295 








Asp 


.file 


Arg 




lyr 


Thr 


Ala 


Php 
rue 


■? n tz, 










J X w 








ax a 


GlU 


Pro 




Lys 


Arg 


Tyr 


Tip 

lie 


Tyr 










J Z3 










Lys 


oxy 


ser 


pro 


Ala 


Tip 

lie 


Php 


OX 11 


Tvr 

iyr 






JlU 










j •* j 


pro 


trilQ 


Arg 


Lys 


Al a 
/U.a 


Asn 






Val 
vai 






3 55 










360 




Asp 


lie 


Leu 


Tip 

lie 




Ser 


Asp 




Thr 














375 






vai 


Leu 




Leu 




\J 1 u 






7\CT1 
/\Oll 












390 








Asp 


r»1 ii 
ulU 


Lys 




Ulll 


T v/e 


As 13 


Pro 


Dm 










405 










Leu 


Trp 


Pro 


rp V« -*~ 

mr 


i»ys 


Trp 


T ve 

jjys 


Leu 




















*4 Z D 


Gill 


val 


Trp 


Thr 


vai 


Asn 


Asp 


Ti * 

lie 


Gin 






A T C 

4 J 5 










A Ad 
*k % V 




Trp 


Ala 


Ala 


Gin 


lie 


Tyr 


Pro 


Giy 


Ti - 

lie 




4 5 U 










4 3 3 






Leu 


tI- 


Arg 


P 1 * 9 

Giy 


Lys 


Met 


inr 


Leu 


inr 












47fl 








Leu 


Ala 


GlU 


7\ "1 — 

Ala 


GlU 


Leu 


r*i n 




Asn 










AQQ 










Gin 


Glu 


P 1 * # 

Giy 


TT « ~ 

HIS 


Tyr 


Tyr 


*7l n 

uin 


V7lU 


t^") ii 

VjlU 








5UU 










eric; 


Gin 


Lys 


Asp 


P 1 

Gin 


Asp 


Asn 




Trp 


inr 






515 










con 




Lys 


He 


Leu 


Lys 


vai 


P 1 * r 

Giy 


Lys 


Tyr 


Aia 




530 










3 J 3 






Asn 


Gly 


val 


Lys 


Leu 


Leu 


Aia 


Gin 


u a i 

vai 


545 










DO u 








Ala 


T A** 

Leu 


vai 


lie 




Arg 


Ti o 

lie 


pro 


Lys 










3D J 










P 1 a « 
GlU 


vai 


Trp 


^i»i 

GlU 


Gin 


Trp 


Trp 


ASp 


Asn 








con 
sou 










585 


Pro 


Asp 


Trp 


Asp 


pne 


vai 


oer 


inr 


Pro 




e o c 
595 










bUU 




Asn 


Leu 


Val 


Giy 


ASp 


pro 


Ti - 

lie 


Pro 


r*1 \/ 
Vjiy 




CIO 

olO 




• 






vl3 






Gly 


Ser 


Cys 


Asn 


Arg 




Cot* 
OC1 


Lys 


r:l 1 1 


625 


















Asp 


Arg 


p i * * 
Giy 


Arg 


Asp 


Lys 


V-a 1 
Val 


Lys 


71 « 

lie 




















Gin 


Axa 


GlU 


Leu 


fill 
bill 


111 a 
Mia 


r^Iic 


A1 a 
nla 


Mpf 








O D v 










665 


Lys 


Ala 


Asn 


Ti- 
ne 


lie 


vai 


ASp 




ri n 
uin 




675 










DOU 




Gly 


Gin 


Pro 


Tnr 


GlU 


bet 




Asn 


Arg 


6 90 










<rqc 

t)73 






Glu 


Met 


He 


Lys 


Lys 


Glu 


»1 - 

Ala 


lie 


Tyr 


705 










710 








Lys 


Gly 


He 


Gly 


/<i 

Giy 


Asn 


Gin 


GlU 


vai 










5 










lie 


Arg 


Gin 


Val 


Leu 


pne 


Leu 


pi !• 
GlU 


Lys 
















745 


His 


Glu 


Lys 


Tyr 


His 


Thr 


Asn 


Val 


Lys 






7S5 










760 




He 


Pr 


Gin 


Leu 


Val 


Ala 


Arg 


Gin 


He 




770 










775 






Gin 


Gin 


Lys 


Gly 


Glu 


Ala 


He 


His 


Gly 


785 










790 








Thr 


Trp 


Gin 


Met 


Asp 


Cys 


Thr 


His 


Leu 



805 
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Val 


Thr 


Gin 


Asp Phe Thr 


Glu 








270 




Glv 


Leu 


Ala 


Lys Lys Arg 


Arq 








285 




Tvr 


Phe 


Ser 


He Pro Leu 


His 






300 






Thr 

X 111 


T .^1 1 
ueu 


Pro 


Ser Val Asn 


Asn 










320 


Lys 


v a. 1 


Leu 


Pro Gin Gly 


Trn 

irp 


"*"*0 

J JU 






335 




1 111 




Arg Gin Val Leu 


Glu 

O X IX 








350 




TI 
lie 


lie 


He 


Gin Tyr Met 


Asp 








365 




Asd 


Leu 


Glu His Asp Lys 


Val 






380 






Ben 
noil 


Leu 


Gly 


Phe Ser Thr 


Pro 




395 






400 


iyr 


ft y*fT 

Arg 


Trp Met Gly Tyr 


Glu 


41 O 
H Iv 






415 




Lys 


lie 


Gin 


Leu Pro Gin 


Lys 








430 




Lys 


Leu 


Val Gly Val Leu 


Asn 








445 




Lys 


Thr 


Lvs 


His Leu Cys 


Arg 






460 






GlU 


ol 
GlU 


Val 


Gin Trp Thr 


ni 11 
GlU 




*t I D 








Arg 


Ti - 

lie 


He 


Leu Ser Gin 


ulu 








495 




Lys 


pi 

GlU 


Leu 


Glu Ala Thr 


Val 






510 




Tyr 


Lys 


He 


His Gin Glu 


ulu 








525 




Lys 


Ti A 

lie 


Lys 


His Thr His 


inr 






540 






vai 


fl rt 

Gin 


Lys 


He Gly Lys 


ulu 












Pne 


His 


Leu 


Pro Val Glu 


Arg 








575 




Tyr 


Trp 


Gin Val Thr Trp 


Ti 0 
lie 








590 




Pro 


Leu 


Val 


Arg Leu Ala 


XrllC 








605 




Thr 


Glu 


Thr 


Phe Tyr Thr 


Asp 






620 






Giy 


Lys 


Ala Gly Tyr Val 


l 111 




f ^ c 






CA ft 


Leu 


/*i «i 
ulu 


Gin 


Thr Thr Asn 


CI rt 

uin 


CCA 






655 




Aia 


Leu 


Thr Asp Ser Gly 


Pro 








670 




Tyr 


vai 


Met 


Gly He Val 


HI a 
nla 






685 




lie 


1/, t 

vai 


Asn 


Gin He He 


ulu 






700 






*t-i 
val 


Ala 


Trp 


Val Pro Ala 


nlS 




713 






T> ft 


Asp 


HIS 


Leu 


Val Ser Gin 


fill/ 

uiy 


730 






-735 




He 


Glu 


Pro 


Ala Gin Glu 


GlU 








750 




fi ti 

ulu 


Leu 


Cys 


His Lys Phe 


Asn 








765 




Val 


Asn 


Thr Cvs Ala Gin 


Tyr 






780 






Gin 


Val 


Asn 


Ala Glu Val 


Gly 




795 






800 


Glu 


Gly 


Lys 


lie He He 


Val 


810 






815 
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ivJLd 


Val 


His 


Val 
820 


Ala 


Ser 


Gly 


Phe 


UflU 


C £i T* 

OCX 


Gl v 
835 




Gin 


Thr 


Ala 


Leu 
840 


Tip 


Pro 


Tip 

lie 


Thx 


His 


Leu 


His 


Thr 


850 










855 








Val 


Lys 


Met 


Val 


Ala 


Trp 


865 










870 






Pi *r 


vol 


c ±. \J 




Asn 


Pro 


Gin 


Ser 








885 








Hi c: 
nis 






Lys 
900 


Asn 


Gin 


He 


Ser 


val 


G1U 


inr 

915 
j j- «j 


Tip 
lie 


V Cl X 


T .pi i 


Met 


Ala 
920 


Arg 


Pi VP 


m v 
uiy 


Tie 


Glv 
vaxy 


Asp 


Met 


Thr 


J J v 










935 




Tl * 

1J.C 


1 Xli 




Glu 
ox u 


Gin 


Glu 


He 


Gin 












950 






Leu 


Lys 


As 11 


Phe 


Arcr 


Val 


Tyr 


Phe 








965 








Lys 


Gly 


Pro 


Gly 


Glu 


Leu 


Leu 


Trp 






980 










Lys 


Val 


Gly 


Thr 


Asp 


He 


Lys 


He 




995 








1000 


He 


Arg 


Asp 


Tyr 


Gly 


Gly 


Arg 


Gin 


1010 








1015 




Glu Gly 


Ala 


Arg 


Glu 


Asn Gly Glu 



025 1030 



He 


Glu 


Ala 


GlU 


Val 


He 


Pro Gin 


825 










830 




Phe 


Leu 


Leu 


Lys 


Leu Ala 


Ser Arg 








845 






Asp 


Asn 


Gly 


Ala 


Asn 


Phe 


Thr Ser 






860 








Trp 


Val 


Gly 


He 


Glu 


Gin 


Thr Phe 




875 








880 


Gin 


Gly 
890 


Val 


Val 


Glu 


Ala 


Met Asn 
895 


Arq 


He 


Arg 


Glu 


Gin 


Ala 


Asn Thr 


905 










910 




Val 


His 


Cvs 


Met 


Asn 
925 


Phe 


Lys Arg 


Pro 


Ser 


Glu 


Arg 
940 


Leu 


He 


Asn Met 


Phe 


Leu 


Gin 


Ala 


Lys 


Asn 


Ser Lys 






955 






960 


Arg 


Glu 


Gly 


Arg 


Asp Gin Leu Trp 


970 










975 


Lys 


Gly 


Asp 


Gly 


Ala 


Val 


He Val 


985 










990 




He 


Pro 


Arg 


Arg 


Lys 


Ala 


Lys He 








1005 






Glu 


Leu 


Asp 


Ser 


Ser 


Ser 


His Leu 






1020 








Val 


Ala 













1 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

<v) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 



Met Pro Arg 


Lys 


Thr Ser 
5 


Gly 


Phe 


Phe 


Arg 
10 


1 

Glu Ala Pro 


Gin 


Phe Pro 


His 


Gly 


Pro 


Asp 




20 








25 


Gly 


Asn Cys Ser 


Pro Arg Gly 


Ser 


Ser 


Cys 


35 








40 




Gin 


Glu Asp Gly 


Gin 


Lys Ala 


Glu 


Gly 


Glu 


50 






55 






Phe 


Gly Asn Gly 


Gly Phe Ala 


Ala 


Pro 


Gin 


65 




70 






Gin 




He Val Thr 


Ala 


Tyr He 


Glu 


Glu 


Pro 






85 








90 


Thr Gly Ala 


Asp 


Asp Ser 


He 


Val 


Ala 


Gly 


100 








105 


Gly 


Tyr Thr Pro 


Lys 


He Val 


Gly 


Gly 


He 


115 








120 


Val 




Glu Tyr Lys 


Asp 


Val Lys 


lie 


Lys 


Leu 


130 






135 




lie 




Thr He Met 


Thr Gly Asp 


Thr 


Pro 


Asn 


145 




150 










Leu Thr Ala 


Met 


Gly Met 


Ser 


Leu 


Asn 


Leu 






165 








170 


Pro He Lys 


Val 


Thr Leu 


Lys 


Pro 


Gly 


Lys 


180 








185 




Gin Trp Pro 


Leu 


Ser Lys 


GlU 


Lys 


He 


He 



15 

Ala Ser Gly Ala Asp Thr 
30 

Ser Thr Glu Glu Leu His 
45 

Arg Glu Thr Leu Gin Gly 
60 

Ser Leu Trp Arg Arg Pro 
75 80 
Val Glu Val Leu Leu Asp 
95 

He Glu Leu Gly Pro Asn 
110 

Gly Phe He Asn Thr Lys 
125 

Gly Lys Val lie Lys Gly 
140 

He Phe Gly Arg Asn Leu 
155 160 
Pro He Ala Lys Val Glu 
175 

Asp Gly Pro Lvs Leu Arg 
190 
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195 200 205 

Glu Lys Met Glu Lys Asd Gly Gin Leu Glu Glu Ala Pro Pro Thr Asn 

210 " 215 220 

Pro Tyr Asn Thr Pro Thr Phe Ala He Lys Lys Lys Asp Lys Asn Lys 
225 230 235 240 

Trp Arg Met Leu He Asp Phe Arg Glu Leu Asn Lvs Val Thr Gin Asp 

245 250 255 

Phe Thr Glu Val Gin Leu Gly He Pro His Pro Ala Gly Leu Ala Lys 

260 265 270 

Arg Arg Arg He Thr Val Leu Asd Val Gly Asp Ala Tyr Phe Ser He 

275 280 285 

Pro Leu Asp Glu Glu Phe Arg Gin Tyr Thr Ala Phe Thr Leu Pro Ser 

290 295 300 

Val Asn Asn Ala Glu Pro Gly Lys Arg Tyr He Tyr Lys Val Leu Pro 
305 310 315 320 

Gin Gly Trp Lys Glv Ser Pro Ala He Phe Gin His Thr Met Arg Asn 

325 330 335 

Val Leu Glu Pro Phe Arg Lys Ala Asn Pro Asp Val Thr Leu He Gin 

340 345 350 

Tyr Met Asp Asp He Leu He Ala Ser Asp Arg Thr Asp Leu Glu His 

355 360 365 

Asp Arg Val Val Leu Gin Leu Lvs Glu Leu Leu Asn Ser He Gly Phe 

370 375 * 380 

Ser Thr Pro Glu Glu Lys Phe Gin Lvs Asd Pro Pro Phe Gin Trp Met 
385 390 " 395 400 

Gly Tyr Glu Leu Trp Pro Thr Lvs Trp Lys Leu Gin Lys He Glu Leu 

405 " 410 415 

Pro Gin Arg Glu Thr Trp Thr Val Asn Asp He Gin Lys Leu Val Gly 

420 425 430 

Val Leu Asn Trp Ala Ala Gin He Tyr Pro Gly He Lys Thr Lys His 

435 440 445 

Leu Cys Arg Leu He Arg Gly Lys Met Thr Leu Thr Glu Glu Val Gin 

450 455 460 

Trp Thr Glu Met Ala Glu Ala Glu Tyr Glu Glu Asn Lys He He Leu 
465 470 475 480 

Ser Gin Glu Gin Glu Gly Cys Tyr Tyr Gin Glu Gly Lys Pro Leu Glu 

485 490 495 

Ala Thr Val He Lys Ser Gin Asp Asn Gin Trp Ser Tyr Lys He His 

500 505 510 

Gin Glu Asp Lys He Leu Lys Val Gly Lys Phe Ala Lys He Lys Asn 

515 520 525 

Thr His Thr Asn Gly Val Ara Leu Leu Ala His Val Val Gin Lys He 

530 535 540 

Glv Lys Glu Ala He Val He Trp Gly Gin Val Pro Arg Phe His Leu 
545 550 555 560 

Pro Val Glu Arg Glu He Trp Glu Gin Trp Trp Thr Asp Tyr Trp Gin 

565 570 575 

Val Thr Trp He Pro Glu Trp Asp Phe Val Ser Thr Pro Pro Leu Val 

580 585 590 

Arg Leu Val Phe Asn Leu Val Lvs Glu Pro He Gin Gly Ala Glu Thr 

595 600 605 

Phe Tyr Val Asp Gly Ser Cys Asn Arg Gin Ser Arg Glu Gly Lys Ala 

610 €15 620 

Glv Tyr Val Thr Asp Arg Gly Arg Asp Lys Ala Lys Leu Leu Glu Gin 
625 630 635 640 

Thr Thr Asn Gin Gin Ala Glu Leu Glu Ala Phe Tyr Leu Ala Leu Ala 

645 650 655 

Asd Ser Gly Pro Lys Ala Asn He He Val Asp Ser Gin Tyr Val Met 

* 660 665 670 

Gly He Val Ala Gly Gin Pro Thr Glu Ser Glu Ser Arg Leu Val Asn 

675 680 685 

Gin He He Glu Glu Met He Lys Lys Glu Ala He Tyr Val Ala Trp 

690 695 700 

Val Pro Ala His Lys Gly He Gly Gly Asn Glr Glu Val Asp His Leu 
705 710 7: 720 

Val Ser Gin Gly He Arg Gin Val Leu Phe Le\ Glu Lys He Glu Pro 

725 730 735 

Ala Gin Glu Glu His Glu Lys Tyr His Ser Asn Val Lys Glu Leu Val 

740 745 750 

Phe Lys Phe Gly Leu Pro Arg Leu Val Ala Lys Gin He Val Asp Thr 
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755 760 



Cys 


Asp 


Lys 


Cys 


His 


Gin 


Lys 


Gly 




770 










775 




Ala 


Glu 


Leu 


Gly 


Thr 


Trp 


Gin 


wet 


785 










790 






lie 


lie 


lie 


Val 


Ala 


Val 


His 


vai 










805 








Val 


lie 


Pro 


Gin 


Glu 


Thr 


Gly 


Arg 








820 










Leu 


Ala 


Ser 


Arg 


Trp 


Pro 


He 


Thr 






835 










840 


Asn 


Phe 


Thr 


Ser 


Gin 


Glu 


Val 


Lys 




850 










855 




Glu 


Gin 


Thr 


Phe 


Gly 


Val 


Pro 


Tyr 


865 










870 






Glu 


Ala 


Met 


Asn 


His 


His 


Leu 


Lys 










885 








Gin 


Ala 


Asn 


Ser 


He 


Glu 


Thr 


He 








900 










Asn 


Phe 


Lys 


Arg 


Arg 


Gly 


Gly 


lie 






915 










920 


Leu 


Val 


Asn 


Met 


He 


Thr 


Thr 


Glu 




930 










935 




Lys 


Asn 


ser 


Lys 




Lys 


Asn 


Phe 


945 










950 






Asp 


Gin 


Leu 


Trp 


Lys 


Gly 


Pro 


Gly 








965 








Ala 


Val 


He 


Leu 


Lys 


Val 


Gly 


Thr 








980 










Lys 


Ala 


Lys 


He 


He 


Lys 


Asp 


Tyr 




995 








1000 


Gly 


Ser 


His 


Leu 


Glu 


Asp 


Thr Gly 



1010 1015 
(2) INFORMATION FOR 



765 



uiU 


Mia 


lie 


ui e fll v Gin 


Val Asn 








"7 d n 

/ ou 




Asp 


cys 


Thr 

1 ill. 


Hi €; T.pii Glu 


Gly Lys 


"70 C 

79b 




O U \J 


7\ "1 — 

Ala 


ser 




trie lie vj-l u. 


Ala nln 




Q1 A 
OlO 




O <L J 


oin 


inr 


ft! a 


T ait DVl O T.AU 
rile LCU 




825 






mo 




His 


Leu 


His 


Tnr Asp Asn 


Hi vr TV 1 a 
uiy Ala 












Met 


Val 


Aia 


1 ip lrp ri-Ld 










860 




Asn 


Pro 


Gin 


ser uin uiy 


Val Val 






875 




0 O V 


Thr 


Gin 


He 


Asp Arg ne 


rvx y vjiu 




890 








Val 


Leu 


Met 


Ala vai ills 




905 










Gly 


Asp 


Met 


xnr fro nia 










Gin 


Glu 


ne 


vin xrllc ulU 


Gin Ser 








940 




Arg 


vol 


Tvr 


Tvr Ara Glu 


Glv Ara 




955 




960 


Glu 


Leu 


Leu 


Trp Lys Gly 


Glu Gly 




970 






975 


Glu 


He 


Lys 


Val Val Pro 


Arg Arg 


985 




990 




Gly 


Gly 


Gly 


Lys Glu Leu 


Asp Ser 




1005 




Glu 


Ala 


Arg 


Glu Val Ala 





1020 

ID NO:25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



Ser 


Thr 


Lys 


Lys 


Lys 
5 


Arg 


Leu 


Leu 


Ala 


1 

Asn 


Glu. 


Arg 


Leu 


His 


Arg 


Lys 


Thr 


Gly 






20 










25 


Ala 


Phe 


Pro 


Gin 


Arg 


Glu 


Ala 


Arg 


Gin 






35 










40 




Thr 


Asn 


Gly 


Pro 


Thr 


Asp 


Arg 


Glu 


Leu 




50 








55 




Glu 


Glu 


Pro 


Gly 


Glu 


Glu 


Arg 


Gly 


Arg 


65 








nr\ 








Pro 


Gin 


He 


Thr 


Leu 


Trp 


Gin 


Arg 


Pro 


Gly 


Gin 


Leu 


Cys 


85 
Glu 


Ala 


Leu 


Leu 


Asd 






100 










105 


He 


Glu 


Arg 


He 


Gin 


Leu 


Gin 


Gly 


Leu 






115 










120 




Gly 


He 


Gly 


Gly 


Phe 


He 


Lys 


Val 


Lys 


130 










135 




Gly 


Glu 


He 


Glu 


Gly 


Arg 


Lys 


Val 


Val 


145 










150 









10 15 
Glu Phe Phe Arg Glu Arg Leu 
30 

Leu Cys Ala Glu Gin Asn Arg 
45 

Trp Val Pro Gly Gly Arg Glu 
60 

Gin Ser He Ser Thr Asn Leu 

75 80 
Leu He Pro Val Lvs Val Glu 
90 95 
Thr Gly Ala Asp Asp Thr Val 
110 

Trp Lys Pro Lys Met He Gly 
125 

Gin Phe Asp Asn Val His He 
140 

Thr Val Leu Val Gly Pro Thr 
155 160 
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Pro 


Val 


Asn 


He 


He 


Gly 


Arg 


Asn 










165 








Leu 


Val 


Phe 


Pro 


He 


Ser 


Ser 


He 








1B0 










Pro 


Gly 


Met 


Asp 


Gly 


Pro 


Lys 


Val 




195 










200 


Lvs 


lie 


Lvs 


Ala 


Leu 


Thr 


Glu 


He 


210 










215 




Lys 


lie 


Ser 


Lys 


He 


Glv 


Pro 


Glu 


225 










230 






Ala 


lie 


Lys 


Lys 


Lvs 


Asd 


Ser 


Thr 










245 








«xy 


Glu 


Leu 


Asn 


LVS 


Atq 


Thr 


Gin 








260 










lie 


Pro 


His 


Pro 


Ala 


Glv 


Leu 


LVS 






275 










280 


Hen 


val 
vox 


uxy 


Asp 


Ala 


Tvr 
x yj. 


Phe 


Ser 




A 7 w 










295 






Tyr 


iili 


Ala 


Phe 


Thr 


He 


Pro 


J V J 








310 






Vdl 


Arg 


Tyr 


o"l n 
uin 




Asn 


Val 


Leu 










325 








Ser 


xxc 


Plie 


Gin 


Ser 


Ser 


Met 


Thr 








34 0 










Lys 


Asn 


Pro 


Asp 




X ilX 


He 


Tvr 
x yx 






355 










360 


uiy 


ser 


Asp 


L6U 




Tip 
X 1C 


Asp 


Gin 




1 "7 A 










375 




Arg 


Gin 


HIS 


Leu 


Leu 


Lys 


Trp 


nl \i 


"7 Q tl 










J ^ u 






urXIl 


Lys 


bill 


Pro 


Pro 


XT ilC 












405 








Lys 


Trp 


Th^ 


Val 




Pro 


He 


Gin 








420 










vcix 






He 


Gin 


Lys 


Leu 


lie 






435 










440 


lie 




Pro 


Glv 


He 


Lvs 


He 


LVS 




450 










455 




Thr 


Lys 


Lys 


Leu 


Thr 


Asd 


Val 


Val 


465 










470 






f5l u 

VJX U 


iJCU 


Ala 
nxa 


Glu 


Asn 


Arcr 


Glu 


He 










485 








Tyr 


iyr 






Asp 


Lys 


Glu 


Leu 








500 










Asn 


Cys 


r*l n 


Trn 

irp 


Thr* 

1 iiX 


1 yx 


Gin 


lie 




515 










520 


L»ys 


inr 


uiy 


Lys 


ryr 


Al a 

/-IX CI 




Gin 












535 




Arg 


bin 


Leu 


Al a 


Glu 


Ala 


Val 


Gin 


C Jl C 










550 






Tl o 
lie 


irp 


nl v 
vjxy 


T.VC 
iJjr & 


Thr 


Pro 


Lys 


Phe 








565 








irp 


UlU 


Al a 
Ala 




Tr*D 


Ala 


Glu 


Tvr 








580 










rrp 


olu 


flic 


IXC 


Asn 


Thr 


Pro 


Pro 




595 










600 


ulu 


inr 






xxc 


Pro 


Thr 


Thr 














615 




Ala 


Asn 


Arg 


m 11 


Thr 


Lys 


Thr 


Glv 


oc 

O^D 








630 






pi 

v»iy 


Lys 


uiu 




Tl A 
xxc 


He 

xxc 


Ser 


Leu 










645 








Glu 


Lieu 


Lys 


Ala 


Leu 


Leu 


Leu 


Ala 






660 










Asn 


He 


Val 


Thr 


Asp 


Ser 


Gin 


Tyr 






675 








680 


Pro 


Asp 


His 


Ser 


Glu 


S r 


Glu 


Leu 




690 










695 




lie 


Lys 


Lys 


Glu 


Lys 


He 


Tyr 


Leu 



70S 710 



•84- 



He 


Leu 


Thr 


Gin 


Leu 


Gly Cys 


Thr 




170 








175 




Glu 


Thr 


Val 


Pro 


Val 


Lys Leu 


Lys 


185 










190 




Lys 


Gin 


Trp 


Pro 


Leu 


Ser Ala 


Glu 






205 






Cys 


Gin 


Glu 


Met 


Glu Lys Glu 


Gly 








220 








Asn 


Pro 


Tvr 

j 


Asn 


Thr 


Pro lie 


Phe 






235 








240 


Lvs 


Trp 


Ara 


Lvs 
j 


Leu Val Asp 


Phe 




250 








255 




Asp 


Phe 


Trp 


Glu 


Val 


Gin Leu 


Gly 


265 








270 




LVS 


LVS 


Lvs 


Ser 


Val 


Thr Val 


Leu 










285 






Cvs 


Pro 


Leu 


Asp 


Lys 


Asp Phe 


Ara 








300 








Ser 


He 


Asn 


Asn 


Glu 


Thr Pro 


Glv 






315 








320 


Pro 


Gin 


Glv 
uxy 


Trp Lys Gly Ser 


Pro 




330 








335 




Lvs 


He 


Leu 


Glu 


Pro 


Phe Arg 


Glu 


345 










350 




Gin 


1 y i 


Met 


Asp 


Asp 


Leu Tyr 


Val 










365 






His 


A T"CT 


Lys 


Lys 


Val 


Glu Glu 


Leu 








380 








cim 


1 Hi 


1 in 


Pro 


Asp 


Lys Lys 


nxa 






395 








400 


Mot* 


nl \r 
uiy 


Tyr 


Glu 


Leu 


His Pro 


Hen 




** X v 








415 




- 

Leu 


Drn 
ri w 


VJX U 


Lys 


Glu 


Val Trp 


Thr 


425 










430 




Glv 


Lys 


Leu 


Asn 


Trp Ala Ser 


Gin 










445 






Gin 


Leu 


Cvs 


Lys 


Leu 


He Arg 


Gly 








460 








Pro 


Leu 


Thr 


Pro 


Glu 


Ala Glu 


Leu 






475 








480 


Val 


Ser 


Thr 


Pro Val His Gly 


Val 




490 








495 




He 


Ala 


Glu 


He 


Gin 


Lys Gin 


Glv 


505 










510 




Phe 


Gin 


Glu 


Pro 


His 


Lys Asn 


Leu 










525 








Cor 
OCX 


Al a 
nXa 


His 


Thr 


Asn Asp 


He 








540 








Lys 


Tie 

X X c 


Ala 


Thr 


Glu 


Ser He 


Val 




555 








560 


«xy 


Leu 


Pro 


Val 


Gin 


Lys Glu 


Ser 




570 








575 




Tro 


Gin 


Ala 


Thr 


Trp 


He Pro 


Glu 


585 










590 




Leu 


Val 


Lys 


Leu 


Trp 


Tyr Ser 


Leu 










605 






Asd 


Thr 


Tvr 


Tyr Val Asp Gly 


Ala 








620 








Lvs 


Ala 


Glv 
oxy 


Tyr 


Val 


Thr Asp 


Lys 






635 








640 


nl u 

VJX u 


ften 


Thr 
1. *ii 


Thr 


Asn 


Gin Gin 


Ala 




650 








655 




Leu 


Gin 


Asd 


Ser Asp 


Gin Gin 


Val 


665 










670 




Val 


Leu 


Gly 


lie 


He 


Gin Ser 


Gin 








685 






Val 


Asn 


Gin 


He 


He 


Glu Glu 


Leu 








700 








Ser 


Trp 


Val 


Pro 


Ala 


His Lys 


Gly 






715 








720 
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He 


Gly 


Gly 


Asn 


Glu 


Gin 


Val 


Asp 


Lys 


Leu 


Val 


Ser 


Ala 


Gly 


He 


Arg 








725 










730 










735 




Lys 


Val 


Leu 


Phe 


Leu 


Asp 


Gly 


He 


Asp 


Arg 


Ala 


Gin 


Glu 


Glu 


His 


Glu 






740 










745 










750 






Arg 


Tyr 


His 


Ser 


Asn 


Trp 


Lys 


Ala 


Met 


Ala 


Ser Asp 


Phe 


Asn 


Leu 


Pro 


755 










760 










765 








Pro 


He 


Val 


Ala 


Lys 


Glu 


He 


Val 


Ala 


His 


Cys 


Asp 


Lys 


Cys 


Gin 


Val 




770 










775 










780 










Lys 


Gly 


Glu 


Ala 


Met 


His 


Gly 


Gin 


Val 


Asp 


Cys 


Ser 


Pro 


Gly 


He 


Trp 


785 








790 










795 










800 


Gin 


Val 


Asp 


Cys 


Thr 


His 


Leu 


Glu 


Gly 


Lys 


Val 


He 


He 


Val 


Ala 


Val 










805 










810 










815 




His 


Val 


Ala 


Ser 


Gly 


Tyr 


He 


Glu 


Ala 


Glu 


Val 


He 


Pro 


Ala 


Glu 


Thr 








820 










825 










830 






Gly 


Gin 


Glu 


Thr 


Ala 


Tyr 


Phe 


Leu 


Leu 


Lys 


Leu Ala Gly 


Arg 


Trp 


Pro 




835 










840 










845 








Val 


Lys 


Thr 


He 


His 


Thr 


Asp 


Asn 


Gly 


Pro 


Asn 


Phe 


Thr 


Ser 


Ala 


Ala 




850 










855 










660 










Val 


Lys 


Ala 


Ala 


Cys 


Trp 


Trp 


Ala 


Asp 


He 


Lys 


Gin 


Glu 


Phe Gly 


He 


B65 








870 










875 










880 


Pro 


Tyr 


Asn 


Pro 


Gin 


Ser 


Gin 


Gly 


Val 


Val 


Glu 


Ser 


Leu 


Asn Lys 


Glu 








885 










890 










895 




Leu 


Lys 


Lys 


He 


He 


Gly 


Gin 


Val 


Arg 


Asp 


Gin 


Ala 


Glu 


His 


Leu 


Lys 








900 










905 










910 






Thr 


Ala 


Val 


Gin 


Met 


Ala 


Val 


Phe 


He 


His 


Asn 


Phe 


Lys 


Arg 


Lys 


Gly 






915 










920 










925 








Gly 


He 


Gly 


Gly 


Tyr 


Thr 


Ala 


Gly 


Glu 


Arg 


He 


He Asp 


He 


He 


Ala 


930 










935 










940 










Thr 


Asp 


He 


Gin 


Thr 


Ser 


Glu 


Leu 


Gin 


Lys 


Gin 


He 


Leu 


Lys 


Val 


Gin 


945 








950 










955 










960 


Lys 


Phe 


Arg 


Val 


Tyr 


Tyr 


Arg 


Asp 


Ser 


Arg 


Asp 


Pro 


He 


Trp 


Lys 


Gly 








965 










970 










975 




Pro 


Ala 


Thr 


Leu 


Leu 


Trp 


Lys 


Gly 


Glu 


Glv 


Ala 


Val 


Val 


He 


Gin 


Asp 








980 










985 










990 






Gin 


Gly 


V3JL U 


Leu 


Lys 


val 


Val 


Pro 


Arg 


Arg 


Lys 


Ala 


Lys 


He 


He 


Arcr 




995 








1000 








1005 








Asp 


Tyr 


Gly 


Lys 


Gin 


Met 


Ala Gly 


Asp 


Asp 


Cys 


Val 


Ala 


Ser 


Arg 


Gin 


1010 








1015 








1020 










Asn 


Glu 


Asp 




























025 




































(2) 


INFORMATION FOR SEQ ID 


NO: 26: 














(i) SEQUENCE CHARACTERISTICS: 




















(AJ 


LENGTH: 


1124 amino acids 


















(B) 


TYPE : amino 


acid 






















(C) 


STRANDEDNESS : single. 




















(D) 


TOPOLOGY: linear 




















(ii) MOLECULE 


TYPE: peptide 


















(iii) 


HYPOTHETICAL: NO 




















<iv) ANTISENSE: NO 






















(v) FRAGMENT TYPE: 


; internal 


















<vi) ORIGINAL 


SOURCE : 






















(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO:26: 










Lys 


Glu 


Phe 


Gly 


Lys 


Leu 


Glu 


Gly Gly 


Ala 


Ser 


Cys 


Ser 


Pro 


Ser 


Glu 


1 








5 










10 










15 




Ser 


Asn 


Ala 


Ala 


Ser 


Ser 


Asn 


Ala 


He 


Cys 


Thr 


Ser 


Asn 


Gly Gly 


Glu 








20 










25 










30 






Thr 


He 


Gly 


Phe 


Val 


Asn 


Tyr 


Asn 


Lys 


Val 


Gly 


Thr 


Thr 


Thr 


Thr 


Leu 






35 










40 










45 




He 




Glu 


Lys 


Arg 


Pro 


Glu 


He 


Leu 


He 


Phe 


Val 


Asn 


Gly Tyr 


Pro 


Lys 




50 








55 










60 










Phe 


Leu 


Leu 


Asp 


Thr 


Gly 


Ala 


Asp 


He 


Thr 


lie 


Leu 


Asn 


Arg 


Arg 


Asp 


65 








70 










7S 










80 


Phe 


Gin 


Val 


Lys 


Asn 


Ser 


He 


Glu 


Asn 


Gly 


Arg 


Gin 


Asn 


Met 


He 


Gly 








85 










90 










95 


Glu 


Val 


Gly 


Gly 


Gly 


Lys 


Arg 


Gly 


Thr 


Asn 


Tyr 


He 


Asn 


Val 


His 


Leu 
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100 

lie Arg Asp Glu 
115 

Val Leu Glu Asp 
130 

Met lie Lys Phe 
145 

He Pro Val Val 

He Lys Gin Trp 
180 

He Val Glu Arg 
195 

Asn Asn Pro Trp 
210 

Lys Trp Arg Met 
225 

Lys Gly Ala Glu 

He Lys Lys Gin 
260 

He Pro Leu Asp 
275 

Arg Lys Asn Asn 
290 

Pro Gin Gly Trp 
305 

Asn He He Gin 

Gin Tyr Met Asp 
340 

His Lys Glu Lys 
355 

Phe Glu Thr Pro 
370 

Met Gly Tyr Glu 
385 

Leu Asp He Pro 

Gly Lys He Asn 
420 

Ala Leu Thr Asn 
435 

Gin Trp Thr Lys 
450 

He Glu Glu Gin 
465 

Tyr Ala Lys Leu 

Tyr Gin Lys Asp 
500 

Gin Lys Lys Lys 
515 

Tyr Lys He Arg 
530 

Tyr Glu He Pro 
545 

Ser Pro Tyr Leu 

Ala Leu Asn He 
580 

Pro Gly Ala Glu 
595 

Ala Ala Lys Ala 
610 

Asp Leu Glu Gly 
625 

Ala Leu Lys Ala 
Tyr Val He Asn 



Asn 


Tyr 


Lys 


Thr 








120 


Asn 


Ser 


Leu 


He 






135 




Asn 


He 


Atq 


Leu 




150 






Lvs 


Val 


Lvs 


Met 


165 








Pro 


Leu 


Thr 


Asn 


Leu 


Glu 


Lys 


Glu 








200 


Asn 


Thr 


Pro 


Val 






215 




Leu 


He 


Asp 


Phe 




230 






Val 


Gin 


Leu 


Glv 


245 








Val 


Thr 


Val 


Leu 


Pro 


Asp 


Tyr 


Ala 








280 




vj-i- y 


*r 1 W 


w«l y 






OQC 




lie 


Leu 


Ser 


Pro 




310 






tr i \j 


Phe 


He 




7^ 5 








Asp 


Tip* 

lie 


iyr 


11C 


Val 


Glu 


Glu 


Leu 








360 


UlU 


nay 


T A/C 

l>y o 








J / 3 




Leu 


IllS 


Pro 


Leu 




*a q n 






ulU 


m n 

uXu 


Pro 


Thr 

L 111 


*k US 








Tm 
1 £ P 


Ala 


Ser 


Gin 


Met 


Met 


Arg 


Gly 








440 


Glu 


Ala 




Leu 






455 




Val 


Gin 


Leu 


Glv 




470 






Ser 


Leu 


Val 


Glv 


485 








Pro 


Glu 


Lys 


He 


Ala 


Glu 


Asn 


Thr 








520 




Glu 


Ser 


He 






5 3 5 




Thr 


Ser 


Ara 


Glu 




5 50 






Lys 


Al a 

r\A- d 


Pro 


Pro 


rrc 
DD3 








T \/c 
J-iyo 


T\ ret 


Ala 


Leu 


J. XIX 


i rp 


Tvr 
iyt 


He 








600 


Ala 


Tyr 


Trp 


Thr 






615 




Ser 


Asn 


Gin 


Lys 




630 






Gly 


Ser 


Glu 


Glu 


645 








He 


He 


Leu 


Gin 



•86- 



105 








Gin 


Cvs 


lie 


Phe 


Gin 


Pro 


Leu 


Leu 








140 


Val 


Met 


Ala 


Gin 






155 




Lvs 


Asp 


Pro 


Asn 




170 






Glu 


Lvs 


lie 


Glu 


185 








Glv 


Lvs 


Val 


Lvs 

uy o 


Phe 


Ala 


He 


Lvs 








220 


Arg 


Glu 


Leu 


Asn 






235 




Leu 


Pro 


His 


Pro 




250 






Asp 


He Gly 




?fi5 

£ O 3 








Prn 


Tyr 


Thr 




Arg 


Arg 


Phe 


vdl 








300 


Leu 


He 


Tyr 


Gin 






315 




XII 


Asn 


Pro 


oin 




330 






uiy 


Ser 


Asn 


LSU 










Arg 


Lys 


Leu 


L&U 


vjin 


Glu 


Glu 


Pro 








360 


Thr 


Trp 


Thr 


He 






395 




Leu 


Asn 


Glu 


Leu 




410 






Aid 


He 


Pro 


_ 

ASp 


*t ^ ^ 










Gin 


Asn 




Glu 


Val 


Gin 


Lys 








460 


Tyr 


Tyr Asp 


Pro 






475 




Pro 


His 


Gin 


11C 




490 






Lieu 


Trp Tyr 


\j i. y 


enc 








v-ys 


Asp 


He 


Aid 


11C 


Arg 


lie 


vji y 








540 


Ala 


Trp Glu 


Ser 






555 




Pro 


Glu 


Val 


ii 

ulu 




570 






C 


Met 


He 


Lys 


CDC 

3 0 3 








Asp 


Gly Gly 


Arg 


Asp 


Thr Gly 


Lys 








620 


Ala 


Glu 


lie 


Gin 






635 




Met 


Asn 


lie 


lie 




650 






Gin 


Pro Asp 


Met 



110 

Gly Asn Val Cys 
125 

Gly Arg Asp Asn 

He Ser Asd Lys 
160 

Lys Gly Pro Gin 
175 

Ala Leu Thr Glu 
190 

Arg Ala Asd Ser 
205 

Lys Lys Ser Gly 

Lys Leu Thr Glu 
240 

Ala Gly Leu Gin 
255 

Ala Tvr Phe Thr 
270 

Phe Thr Leu Pro 
285 

Trp Cys Ser Leu 

Ser Thr Leu Asp 
320 

Leu Asp He Tyr 
335 

Ser Lys Lys Glu 
350 

Leu Trp Trp Gly 
365 

Pro Tyr Thr Trp 

Gin Gin Lys Gin 
400 

Gin Lys Leu Ala 
415 

Leu Ser He Lys 
430 

Asn Ser Thr Arg 
445 

Ala Lys Lys Ala 

Ser Lys Glu Leu 
480 

Ser Tyr Gin Val 
495 

Lys Met Ser Arg 
510 

Leu Arg Ala Cys 
525 

Lys Glu Pro Arg 

Asn Leu He Asn 
560 

Tyr lie His Ala 
575 

Asp Ala Pro He 
590 

Lys Leu Gly Lys 
605 

Trp Arg Val Met 

Ala Leu Leu Leu 
640 

Thr Asp Ser Gin 
655 

Met Glu Gly lie 
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660 665 670 

Trp Gin Glu Val Leu Glu Glu' Leu Glu Lys Lys Thr Ala lie Phe lie 

675 680 685 

Asp Trp Val Pro Gly His Lys Gly He Pro Gly Asn Glu Glu Val Asp 

690 695 " 700 

Lys Leu Cys Gin Thr Met Met He He Glu Gly Asp Gly He Leu Asp 
705 710 715 720 

Lys Arg Ser Glu Asp Ala Gly Tyr Asp Leu Leu Ala Ala Lys Glu He 

725 730 735 

His Leu Leu Pro Gly Glu Val Lys Val He Pro Thr Gly Val Lys Leu 

740 745 750 

Met Leu Pro Lys Gly Tyr Trp Gly Leu He He Gly Lys Ser Ser He 

755 760 765 

Gly Ser Lys Glv Leu Asp Val Leu Gly Gly Val He Asp Glu Gly Tyr 

770 775 780 

Arg Gly Glu He Gly Val lie Met He Asn Val Ser Arg Lys Ser He 
785 790 795 800 

Thr Leu Met Glu Arg Gin Lys He Ala Gin Leu He He Leu Pro Cys 

805 810 815 

Lys His Glu Val Leu Glu Gin Gly Lys Val Val Met Asp Ser Glu Arg 

820 825 830 

Gly Asp Asn Gly Tyr Gly Ser Thr Gly Val Phe Ser Ser Trp Val Asp 

835 840 845 

Arg He Glu Glu Ala Glu He Asn His Glu Lys Phe His Ser Asp Pro 

850 855 860 

Gin Tyr Leu Arg Thr Glu Phe Asn Leu Pro Lys Met Val Ala Glu Glu 
865 870 875 880 

He Arg Arg Lys Cys Pro Val Cys Arg He He Gly Glu Gin Val Gly 

885 890 895 

Gly Gin Leu Lys lie Gly Pro Gly He Trp Gin Met Asp Cys Thr His 

900 905 910 

Phe Asp Gly Lys He He Leu Val Gly He His Val Glu Ser Gly Tyr 

915 920 925 

He Trp Ala Gin He He Ser Gin Glu Thr Ala Asp Cys Thr Val Lys 

930 935 940 

Ala Val Leu Gin Leu Leu Ser Ala His Asn Val Thr Glu Leu Gin Thr 
945 950 955 960 

Asp Asn Gly Pro Asn Phe Lys Asn Gin Lys Met Glu Gly Val Leu Asn 

965 970 975 

Tyr Met Gly Val Lys His Lys Phe Gly He Pro Gly Asn Pro Gin Ser 

980 985 990 

Gin Ala Leu Val Glu Asn Val Asn His Thr Leu Lys Val Trp He Gin 

995 1000 1005 

Lys Phe Leu Pro Glu Thr Thr Ser Leu Asp Asn Ala Leu Ser Leu Ala 

1010 1015 1020 

Val His Ser Leu Asn Phe Lys Arg Arg Gly Arg He Gly Gly Met Ala 
025 1030 1035 1040 

Pro Tyr Glu Leu Leu Ala Gin Gin Glu Ser Leu Arg He Gin Asp Tyr 

1045 1050 1055 

Phe Ser Ala He Pro Gin Lys Leu Gin Ala Gin Trp He Tyr Tyr Lys 

1060 1065 1070 

Asp Gin Lys Asp Lys Lys Trp Lys Gly Pro Met Arg Val Glu Tyr Trp 

1075 1080 1085 

Gly Gin Gly Ser Val Leu Leu Lys Asp Glu Glu Lys Gly Tyr Phe Leu 

1090 1095 1100 

He Pro Arg Arg His He Arg Arg Val Pro Glu Pro Cys Ala Leu Pro 
105 1110 1115. 1120 

Glu Gly Asp Glu 

1 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 701 ammo acids 

(B) TYPE: amino acid 

<C) STRANDEDNES5 : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 27: 



1*1 c w 


ux u 




Val 


He 


Lys 


Val 


He 


Ser 


Ser Ala 


v-y » 


Lys Thr Tyr 


Cys 










5 










10 




15 




ul y 


_ 

i*ys 


Thr* 

J. ilX 


Car 

OCX 


Pro 


Ser 


Lvs 


Lvs 


Glu 


He Gly 


Ala 


Mf*t Leu Ser 


Leu 








20 










25 






30 




Leu 




x»y o 


CZ 1 ii 




Leu 


Leu 


Met 


Ser 


Pro Ser 


Asp 


Lpn Tvr Ser 


Pro 






35 










40 








45 




uiy 


ser 


irp 


ASp 




Tie 
x x c 


Thr 


Ala 


Ala 


Leu Ser 


Gin 


tra Ala Met 


He 


50 










55 








•60 






L6U 


Giy 


Lys 


ser 


uiy 




T .on 


x>y 0 


Thr 

11 IX 


x x vjx y 


T .01 1 
lieu 


Va 1 L,eu Gl v 
v ct x ijcu wxy 


Ala 

Aid 


O D 










70, 








7*5 






0 V 


Leu 


Lys 


Ala 


Ala 


Arg 


GIU 


olU 


uin 


vai 






Gin Aia juys 


irne 








b b 


















Trp 


Leu 


Gly 


Leu 


Gly 


Giy 


GIV 


Arg 


vai 


Ser Pro 


Pro 


Giy pro giu 


Cys 






10U 
















llU 




lie 


Glu 


Lys 


Pro 


Ala 


Thr 


Glu 


Arg 


Arg 


He Asp 


Lys 


Gly Glu Glu 


val 






115 










120 








125 




Gly 


Glu 


Thr 


Thr 


Val 


Gin 


Arg 


Asp 


Ala 


Lys Met 


Ala 


Pro Glu Glu 


Thr 


130 










135 








140 






Ala 


Thr 


Pro 


Lys 


Thr 


Val 


Gly 


Thr 


Ser 


Cys Tyr 


His 


Cys Gly Thr 


Ala 


145 










150 








155 






160 


lie 


Gly 


Cys 


Asn 


Cys 


Ala 


Thr 


Ala 


Ser 


Ala Pro 


Pro 


Pro Pro Tyr 


Val 






165 










170 




175 




Gly 


Ser 


Gly 


Leu 


Tyr 


Pro 


Ser 


Leu 


Ala 


Gly Val 


Gly 


Glu Gin Gin 


Gly 








180 










185 






190 




Gin 


Gly 


Gly 


Asp 


Thr 


Pro 


Pro 


Gly 


Ala 


Glu Gin 


Ser 


Arg Ala Glu 


Pro 




195 










200 








o a e 
205 




Gly 


His 


Ala 


Gly 


Gin 


Ala 


Pro 


Gly 


Pro 


Ala Leu 


Thr 


Asp Trp Ala 


Arg 




210 










215 








220 






Val 


Arg 


Glu 


Glu 


Leu 


Ala 


Ser 


Thr 


Gly 


Pro Pro 


Val 


Val Ala Met 


Pro 


225 








230 














24U 


Val 


Val 


lie 


Lys 


Thr 


Glu 


Gly 


Pro 


Ala 


Trp Thr 


Pro 


Leu Glu Pro 


Lys 








245 










*5 C A 

2b0 




0 c c 




Leu 


lie 


Thr 


Arg 


Leu 


Ala 


Asp 


Thr 


vai 


Arg Thr 


Lys 


Gly Leu Arg 


ser 








260 










-) c c 






2 t\J 




Pro 


lie 


Thr 


Met 


Ala 


Glu 


ir_ 1 

Val 


Glu 


Ala 


Leu Met 


Ser 


Ser Pro Leu 


Leu 






275 










*5 O A 

280 








2ob 




Pro 


His 


Asp 


Val 


Thr 


Asn 


Leu 


Met 


Arg 


Val He 


Leu 


Gly Pro Ala 


Pro 




290 








"1 Q C 








*> A A 

300 






Tyr 


Ala 


Leu 


Trp 


Met 


Asp 


Ala 


Trp 


Gly 


Val Gin 


Leu 


Gin Tnr Val 


lie 


305 








310 








■31c 
3lb 






Tin 
320 


Ala 


Ala 


Ala 


Thr 


Arg 


Asp 


Pro 


Arg 


xllS 


Pro Ala 


Asn 


Giy Gin Giy 


Arg 










32b 










*4 n 




7 ^ 

J w 9 




Gly 


Glu 


Arg 


Thr 


Asn 


Leu 


Asn 


Arg 


Leu 


Lys Gly 


Leu 


Ala ASp biy 


Met 




340 
















3 bU 




Val 


Gly 


Asn 


Pro 


Gin 


Gly 


Gin 


Aia 


Ala 


Leu Leu 


Arg 


riO Giy GIU 


Leu 




355 










JDU 












Val 


Ala 


He 


Thr 


Ala 


Ser 


Ala 


Leu 


Gin 


Ala Phe 


Arg 


Glu vai Ala 


Arg 




370 










375 








IDA 

380 






Leu 


Ala 


Glu 


Pro 


Ala 


Gly 


Pro 


Trp 


Ala 


Asp He 


Met 


Gin Gly Pro 


Ser 


385 










390 








39b 






4 rt 
4U0 


Glu 


Ser 


Phe 


Val 


Asp 


Phe 


Aia 


Asn 


Arg 


Leu He 


Lys 


Kxa Vdl GIU 


Giy 










405 










410 




415 




Ser 


Asp 


Leu 


Pro 


Pro 


Ser 


Ala 


Arg 


Ala 


Pro Val 


lie 


He Asp Cys 


Phe 








420 










425 






430 




Arg 


Gin 


Lys 


Ser 


Gin 


Pro 


Asp 


lie 


Gin 


Gin Leu 


lie 


Arg Thr Ala 


Pro 




435 










440 








445 




Ser 


Thr 


Leu 


Thr 


Thr 


Pro 


Gly 


Glu 


He 


He Lys 


Tyr 


Val Leu Asp 


Arg 




450 










455 








460 






Gin 


Lys 


Thr 


Ala 


Pro 


Leu 


Thr 


Asp 


Gin 


Gly He 


Ala 


Ala Ala Met 


Ser 


465 








470 








475 






480 


Ser 


Ala 


He 


Gin 


Pro 


Leu 


He 


Met 


Ala 


Val Val 


Asn 


Arg Glu Arg 


Asp 










485 










490 




495 




Gly 


Gin 


Thr 


Gly 


Ser 


Gly 


Gly 


Arg 


Ala 


Arg Gly 


Leu 


Cys Tyr Thr 


Cys 






500 










505 






510 
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Gly Ser 


Pro 


Gly 


His 


Ty r 


Gin 


Ala 


Gin 


Cys 


Pro 


Lys 


Lys Arg Lys 


Ser 




515 










520 










525 




Gly Asn 


Ser 


Arg 


Glu 


Arg 


Cys 


Gin 


Leu 


Cys 


Asn Gly 


Met Gly His 


Asn 


530 










535 










540 






Ala Lys 


Gin 


Cys 


Arg 


Lys 


Arg 


Asp 


Gly Asn 


Gin Gly 


Gin Arg Pro 


Gly 


545 








550 










555 






560 


Lys Gly 


Leu 


Ser 


Ser 


Gly 


Pro 


Trp 


Pro Gly 


Pro 


Glu 


Pro Pro Ala 


Val 






565 










570 






575 




Ser Leu 


Ala 


Met 


Thr 


Met 


Glu 


His 


Lys 


Asp 


Arg 


Pro 


Leu Val Arg 


Val 






580 










585 








590 




lie Leu 


Thr 


Asn 


Thr 


Gly 


Ser 


His 


Pro 


Val 


Lys 


Gin 


Arg Ser Val 


Tyr 




595 










600 










605 




He Thr 


Ala 


Leu 


Leu 


Asp 


Ser 


Gly 


Ala Asp 


lie 


Thr 


He lie Ser 


Glu 


610 










615 










620 






Glu Asp 


Trp 


Pro 


Thr 


Asp 


Trp 


Pro 


Val 


Met 


Glu 


Ala 


Ala Asn Pro 


Gin 


625 








630 










635 






640 


He His 


Glv 


lie 


Glv 


Glv 


Glv 


He 


Pro 


Met 


Arg 


Lys 


Ser Arg Asp 


Met 








645 










650 






655 




He Glu 


Leu 


Gly 


Val 


He 


Asn 


Arg 


Asp Gly 


Ser 


Leu 


Glu Arg Pro 


Leu 






660 










665 








670 




Leu Leu 


Phe 


Pro 


Ala 


Val 


Ala 


Met 


Val 


Arg 


Gly Ser 


He Leu Gly 


Arg 




675 










680 










685 




Asp Cys 


Leu 


Gin 


Gly 


Leu 


Gly 


Leu 


Arg 


Leu 


Thr 


Asn 


Leu 




690 










695 










700 








(2) 


INFORMATION FOR SEQ ID 


NO:28: 








(i) SEQUENCE CHARACTERISTICS: 














(A) 


LENGTH: 


1199 amino acids 












(B) 


TYPE: amino 


acid 
















(C) 


STRANDEDNESS : single 














(D) 


TOPOLOGY: linear 














(ii) MOLECULE 


TYPE : peptide 












(iii) 


HYPOTHETICAL: NO 














(iv) ANT I SENSE : NO 
















(v) FRAGMENT TYPE: 


internal 












(vi) ORIGINAL 


SOURCE : 
















(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO:28: 






Gly Gly 


Gin 


Gly 


Gin 


Asp 


Pro 


Pro 


Pro 


Glu 


Pro 


Arg 


He Thr Leu 


Lys 


1 






5 










10 






15 




Val Gly 


Gly 


Gin 


Pro 


Val 


Thr 


Phe 


Leu 


Val 


Asp 


Thr 


Gly Ala Gin 


His 


20 










25 








30 




Ser Val 


Leu 


Thr 


Gin 


Asn 


Pro 


Gly 


Pro 


Leu 


Ser 


Asp 


Lys Ser Ala 


Trp 




35 










40 










45 




Val Gin 


Gly 


Ala 


Thr 


Gly 


Gly 


Lys 


Arg 


Tyr 


Arg 


Trp 


Thr Thr Asp 


Arg 


50 








55 










60 






Lys Val 


His 


Leu 


Ala 


Thr 


Gly 


Lys 


Val 


Thr 


His 


Ser 


Phe Leu His 


Val 


65 








70 










75 






B0 


Pro Asp 


Cys 


Pro 


Tyr 


Pro 


Leu 


Leu 


Gly 


Arg 


Asp 


Leu 


Leu Thr Lys 


Leu 




85 










90 






95 




Lys Ala 


Gin 


He 


His 


Phe 


Glu 


Gly 


Ser 


Gly 


Ala 


Gin 


Val Met Gly 


Pro 




100 










105 








110 




Met Gly 


Gin 


Pro 


Leu 


Gin 


Val 


Leu 


Thr 


Leu 


Asn 


He 


Glu Asp Glu 


His 


115 










12 0 










125 




Arg Leu 


His 


Glu 


Thr 


Ser 


Lvs 


Glu 


Pro 


Asp 


Val 


Ser 


Leu Gly Ser 


Thr 


130 










135 










140 






Trp Leu 


Ser 


Asp 


Phe 


Pro 


Gin 


Ala 


Trp 


Ala 


Glu 


Thr 


Gly Gly Met 


Gly 


145 






150 










155 






160 


Leu Ala 


val 


Arg 


Gin 


Ala 


Pro 


Leu 


-He 


He 


Pro 


Leu 


Lys Ala Thr 


Ser 






165 










170 






1 IO 




Thr Pro 


val 


Ser 


He 


Lys 


Gin 


Tyr 


Pro 


Met 


S-r 


Gin 


Glu Ala Arg 


Leu 






180 








185 








190 




Gly He 


Lys 


Pro 


His 


He 


Gin 


Arg 


Leu 


Leu 


a: p 


Gin 


Gly lie Leu 


Val 


195 










200 










205 




Pro Cys 


Gin 


Ser 


Pro 


Trp 


Asn 


Thr 


Pro 


Leu 


Leu 


Pro 


Val Lys Lys 


Pro 


210 










215 










220 






Gly Thr 


Asn 


Asp 


Tyr 


Arg 


Pro 


Val 


Gin 


Asp 


Leu 


Arg 


Glu Val Asn 


Lys 
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225 230 235 240 

Ara Val Glu Asp He His Pro Thr Val Pro Asn Pro Tyr Asn Leu Leu 

3 245 250 255 

Ser Gly Leu Pro Pro Ser His Gin Trp Tyr Thr Val Leu Asp Leu Lys 

260 265 270 

Asp Ala Phe Phe Cys Leu Arg Leu His Pro Thr Ser Gin Pro Leu Phe 

275 280 285 

Ala Phe Glu Trp Arg Asd Pro Glu Met Gly He Ser Gly Gin Leu Thr 

290 " 295 300 

Trp Thr Arg Leu Pro Gin Glv Phe Lys Asn Ser Pro Thr Leu Phe Asp 
305 310 * 315 320 

Glu Ala Leu His Arg Asp Leu Ala Asp Phe Arg He Gin His Pro Asp 

325 330 335 

Leu He Leu Leu Gin Tyr Val Asd Asd Leu Leu Leu Ala Ala Thr Ser 

340 345 350 

Glu Leu Asp Cys Gin Gin Gly Thr Arg Ala Leu Leu Gin Thr Leu Gly 

355 360 365 

Asn Leu Gly Tyr Arg Ala Ser Ala Lys Lys Ala Gin He Cys Gin Lys 

370 375 380 

Gin Val Lys -Tyr Leu Gly Tyr Leu Leu Lys Glu Gly Gin Arg Trp Leu 
3B5 390 395 400 

Thr Glu Ala Arg Lys Glu Thr Val Met Gly Gin Pro Thr Pro Lys Thr 

405 410 415 

Pro Arg Gin Leu Arg Glu Phe Leu Gly Thr Ala Gly Phe Cys Arg Leu 

420 425 430 

Trp He Pro Gly Phe Ala Glu Met Ala Ala Pro Leu Tyr Pro Leu Thr 

435 440 445 

Lys Thr Gly Thr Leu Phe Asn Trp Gly Pro Asp Gin Gin Lys Ala Tyr 

450 455 460 

Gin Glu He Lys Gin Ala Leu Leu Thr Ala Pro Ala Leu Gly Leu Pro 
465 470 475 480 

Asp Leu Thr Lys Pro Phe Glu Leu Phe Val Asp Glu Lys Gin Gly Tyr 

485 490 495 

Ala Lys Gly Val Leu Thr Gin Lys Leu Gly Pro Trp Arg Arg Pro Val 

500 505 510 

Ala Tyr Leu Ser Lys Lys Leu Asd Pro Val Ala Ala Gly Trp Pro Pro 

515 520 525 

Cys Leu Arg Met Val Ala Ala He Ala Val Leu Thr Lys Asp Ala Gly 

530 535 540 

Lys Leu Thr Met Gly Gin Pro Leu Val He Leu Ala Pro His Ala Val 
545 550 555 560 

Glu Ala Leu Val Lys Gin Pro Pro Asp Arg Trp Leu Ser Asn Ala Arg 

565 570 575 

Met Thr His Tyr Gin Ala Leu Leu Leu Asp Thr Asp Arg Val Gin Phe 

580 585 590 

Gly Pro Val Val Ala Leu Asn Pro Ala Thr Leu Leu Pro Leu Pro Glu 

595 600 605 

Glu Gly Leu Gin His Asn Cvs Leu Asp He Leu Ala Glu Ala His Gly 

610 615 620 

Thr Arg Pro Asp Leu Thr Asd Gin Pro Leu Pro Asp Ala Asp His Thr 
625 630 " 635 640 

Trp Tvr Thr Asp Gly Ser Ser Leu Leu Gin Glu Gly Gin Arg Lys Ala 

645 650 655 

Glv Ala Ala Val Thr Thr Glu Thr Glu Val He Trp Ala Lys Ala Leu 

660 665 670 

Pro Ala Gly Thr Ser Ala Gin Arg Ala Glu Leu He Ala Leu Thr Gin 

675 680 685 

Ala Leu Lys Met Ala Glu Gly Lys Lys Leu Asn Val Tyr Thr Asp Ser 

690 695 700 

Ara Tvr Ala Phe Ala Thr Ala His lie His Gly Glu He Tyr Arg Arg 
705 ^10 715 720 

Ara Glv Leu Leu Thr S r Glu Gly Lys Glu He Lys Asn Lys Asp Glu 

* 1 725 730 735 

He Leu Ala Leu Leu Lys Ala Leu Phe Leu Pro Lys Arg Leu Ser He 

740 745 750 

He His Cys Pro Gly His Gin Lvs Gly His Ser Ala Glu Ala Arg Gly 

755 760 765 

Asn Arg Met Ala Asp Gin Ala Ala Arg Lys Ala Ala He Thr Glu Thr 

770 775 780 

Pro Asp Thr Ser Thr Leu Leu lie Glu Asn Ser Ser Pro Tyr Thr Ser 
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785 790 795 800 

Glu His Phe His Tyr Thr Val Thr Asp lie Lys Asp Leu Thr Lys Leu 

805 810 815 

Gly Ala He Tyr Asp Lys Thr Lys Lys Tyr Trp Val Tyr Gin Gly Lys 

820 825 830 

Pro Val Met Pro Asp Gin Phe Thr Phe Glu Leu Leu Asp Phe Leu His 

835 840 845 

Gin Leu Thr His Leu Ser Phe Ser Lys Met Lys Ala Leu Leu Glu Arg 

850 855 860 

Ser His Ser Pro Tyr Tyr Met Leu Asn Arg Asd Arg Thr Leu Lys Asn 
865 870 875 880 

He Thr Glu Thr Cys Lys Ala Cys Ala Gin Val Asn Ala Ser Lys Ser 

885 890 895 

Ala Val Lys Gin Gly Thr Arg Val Arg Gly His Arg Pro Gly Thr His 

900 905 910 

Trp Glu He Asp Phe Thr Glu lie Lys Pro Gly Leu Tyr Gly Tyr Lys 

915 920 925 

Tyr Leu Leu Val Phe lie Asp Thr Phe Ser Gly Trp He Glu Ala Phe 

930 935 940 

Pro Thr Lys Lys Glu Thr Ala Lys Val Val Thr Lys Lys Leu Leu Glu 
945 950 955 960 

Glu lie Phe Pro Arg Phe Gly Met Pro Gin Val Leu Gly Thr Asd Asn 

965 970 975 

Gly Pro Ala Phe Val Ser Lys Val Ser Gin Thr Val Ala Asp Leu Leu 

980 985 990 

Gly lie Asp Trp Lys Leu His Cys Ala Tyr Arg Pro Gin Ser Ser Gly 

995 1000 1005 

Gin Val Glu Arg Met Asn Arg Thr He Lys Glu Thr Leu Thr Lys Leu 

1010 1015 1020 

Thr Leu Ala Thr Gly Ser Arg Asp Trp Val Leu Leu Leu Pro Leu Ala 
025 1030 1035 1040 

Leu Tyr Arg Ala Arg Asn Thr Pro Gly Pro His Gly Leu Thr Pro Tyr 

1045 1050 1055 

Glu He Leu Tyr Gly Ala Pro Pro Pro Leu Val Asn Phe Pro Asp Pro 

1060 1065 1070 

Asp Met Thr Arg Val Thr Asn Ser Pro Ser Leu Gin Ala His Leu Gin 

1075 1080 1085 

Ala Leu Tyr Leu Val Gin His Glu Val Trp Arg Pro Leu Ala Ala Ala 

1090 1095 1100 

Tyr Gin Glu Gin Leu Asp Arg Pro Val Val Pro His Pro Tyr Arg Val 
105 1110 1115 1120 

Gly Asp Thr Val Trp Val Arg Arg His Gin Thr Lys Asn Leu Glu Pro 

1125 1130 1135 

Arg Trp Lys Gly Pro Tyr Thr Val Leu Leu Thr Thr Pro Thr Ala Leu 

1140 1145 1150 

Lys Val Asp Gly He Ala Ala Trp He His Ala Ala His Val Lys Ala 

1155 1160 1165 

Ala Asp Pro Gly Gly Gly Pro Ser Ser Arg Leu Thr Trp Arg Val Gin 

1170 1175 1180 

Arg" Ser Gin Asn Pro Leu Lys He Arg Leu Thr Arg Glu Ala Pro 
185 1190 1195 1 

(2) INFORMATION FOR SEQ ID NO: 29; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 12 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Thr Leu Asp Asp Gin Gly Gly Gin Glv Gin Glu Pro Pro Pro Glu Pro 
15 10 15 
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Arg lie Thr Leu Lys Val Gly Gly Gin Pro Val Thr Phe Leu Val Asp 

20 25 30 

Thr Gly Ala Gin His Ser Val Leu Thr Gin Asn Pro Gly Pro Leu Ser 

35 40 45 

Asp Lys Ser Ala Trp Val Gin Glv Ala Thr Glv Gly Lvs Arg Tyr Arg 

50 55 60 

Trp Thr Thr Asp Arg Arg Val His Leu Ala Thr Gly Lys Val Thr His 
65 70 75 " 80 

Ser Phe Leu His Val Pro Asp Cys Pro Tvr Pro Leu Leu Gly Arg His 

85 90 95 

Leu Leu Thr Lys Leu Lys Ala Gin lie His Phe Glu Gly Ser Gly Ala 

100 105 110 

Gin Val Val Gly Pro Met Gly Gin Pro Leu Gin Val Leu Thr Leu Asn 

115 120 125 

lie Glu Asp Glu Tyr Arg Leu His Glu Thr Ser Lys Gly Pro Asp Val 

130 135 140 

Pro Leu Gly Ser Thr Trp Leu Ser Asp Phe Pro Gin Ala Trp Ala Glu 
145 150 155 160 

Thr Gly Gly Met Gly Leu Ala Phe Arg Gin Ala Pro Leu He He Ser 

165 170 175 

Leu Lys Ala Thr Ser Thr Pro Val Ser He Lys Gin Tyr Pro Met Ser 

180 185 190 

Gin Glu Ala Arg Leu Gly He Lys Pro His He Gin Arg Leu Leu Asp 

195 200 205 

Gin Gly He Leu Val Pro Cys Gin Ser Pro Trp Asn Thr Pro Leu Leu 

210 215 220 

Pro Val Lys Lys Pro Gly Thr Asn Asp Tyr Arg Pro Val Gin Asp Leu 
225 230 235 240 

Arg Glu Val Asn Lys Arg Val Glu Asp He His Pro Thr Val Pro Asn 

245 250 255 

Pro Tyr Asn Leu Leu Ser Gly Leu Pro Pro Ser His Gin Trp Tyr Thr 

260 265 270 

Val Leu Asp Leu Lys Asp Ala Phe Phe Cys Leu Arg Leu His Pro Thr 

275 280 285 

Ser Gin Ser Leu Phe Ala Phe Glu Trp Lys Asp Pro Glu Met Gly He 

290 295 300 

Ser Gly Gin Leu Thr Trp Thr Arg Leu Pro Gin Gly Phe Lys Asn Ser 
305 310 315 320 

Pro Thr Leu Phe Asp Glu Ala Leu His Arg Asp Leu Ala Asp Phe Arg 

325 330 335 

He Gin His Pro Asp Leu lie Leu Leu Gin Tyr Val Asp Asp Leu Leu 

340 345 350 

Leu Ala Ala Thr Ser Glu Leu Asp Cys Gin Gin Gly Thr Arg Ala Leu 

355 360 365 

Leu Gin Thr Leu Gly Asp Leu Gly Tyr Arg Ala Ser Ala Lys Lys Ala 

370 375 380 

Gin lie Cys Gin Lys Gin Val Lys Tyr Leu Gly Tyr Leu Leu Lys Glu 
385 390 395 400 

Gly Gin Arg Trp Leu Thr Glu Ala Arg Lys Glu Thr Val Met Gly Gin 

405 410 415 

Pro Thr Pro Lys Thr Pro Arg Gin Leu Arg Glu Phe Leu Gly Thr Ala 

420 425 430 

Gly Leu Cys Arg Leu Trp lie Pro Gly Phe Ala Glu Met Ala Ala Pro 

435 440 445 

Leu Tyr Pro Leu Thr Lys Thr Gly Thr Leu Phe Lys Trp Gly Pro Asp 

450 455 460 

Gin Gin Lys Ala Tyr Gin Glu lie Lys Gin Ala Leu Leu Thr Ala Pro 
465 470 * 475 480 

Ala Leu Gly Leu Pro Asp Leu Thr Lys Pro Phe Glu Leu Phe Val Asp 

485 490 495 

Glu Lys Gin Gly Tyr Ala Lys Gly Val Leu Thr Gin Lys Leu Gly Pro 

500 505 510 

Trp Arg Arg Pro Val Ala Tyr Leu Ser Lys Lys Leu Asp Pro Val Ala 

515 520 525 

Ala Gly Trp Pro Pro Cys Leu Arg Met Val Ala Ala He Ala Val Leu 

530 535 540 

Thr Lys Asp Val Gly Lys Leu Thr Met Gly Gin Pro Leu Val He Leu 
545 * 550 555 560 

Ala Pro His Ala Val Glu Ala Leu Val Lys Gin Pro Pro Asp Arg Trp 
565 570 575 
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Leu 


Ser 


Asn 


Ala 


Arg 


Met 


Thr 


His 


Tyr 


Gin 


Ala Leu 


Leu Leu Asp 


Thr 








580 








585 






590 




Asp 


Arg 


Val 


Gin 


Phe 


Gly 


Pro 


He 


Val 


Ala 


Leu Asn 


Pro Ala Thr 


Leu 


595 










600 








605 




Leu 


Pro 


Leu 


Pro 


Glu 


Glu 


Gly 


Leu 


Gin 


His 


Asp Cys 


Leu Asp He 


Leu 




610 










615 








620 






Ala 


Glu 


Ala 


His 


Gly 


Thr 


Arg 


Pro 


Asp 


Leu 


Thr Asp 


Gin Pro Leu 


Pro 


625 










630 










635 




640 


Asp 


Ala 


Asp 


His 


Thr 


Trp 


Tyr 


Thr 


Asp 


Gly 


Ser Ser 


Phe Leu Gin 


Glu 






645 










650 




655 




Gly 


Gin 


Arg 


Arg 


Ala 


Gly 


Ala 


Ala 


Val 


Thr 


Thr Glu 


Thr Glu Val 


He 




660 










665 






670 




Trp 


Ala 


Lys 


Ala 


Leu 


Pro 


Ala 


Gly 


Thr 


Ser 


Ala Gin 


Arg Ala Glu 


Leu 




675 










680 








685 




lie 


Ala 


Leu 


Thr 


Gin 


Ala 


Leu 


Lys 


Met 


Ala 


Ala Gly 


Lys Lys Leu 


Asn 




690 










695 








700 






Val 


Tyr 


Thr 


Asp 


Ser 


Arg 


Tyr 


Ala 


Phe 


Ala 


Thr Ala 


His He His 


Gly 


705 








710 










715 




720 


Glu 


lie 


Tyr 


Arg 


Arg 


Arg 


Gly 


Leu 


Leu 


Thr 


Ser Glu 


Gly Lys Glu 


He 








725 










730 




735 




Lys 


Asn 


Lys 


Asp 


Glu 


He 


Leu 


Ala 


Leu 


Leu 


Lys Ala 


Leu Phe Leu 


Pro 




740 










745 






750 




Lys 


Arg 


Leu 


Ser 


He 


He 


His 


Cys 


Pro 


Gly 


His Gin 


Lys Gly Asn 


His 


755 










760 








765 




Ala 


Glu 


Ala 


Arg 


Gly 


Asn 


Arg 


Met 


Ala 


Asp 


Gin Ala 


Ala Arg Glu 


Val 




770 










775 








780 






Ala 


Thr 


Arg 


Glu 


Thr 


Pro 


Glu 


Thr 


Ser 


Thr 


Leu Leu 


He Glu Asn 


Ser 


785 








790 










795 




800 


Ala 


Pro 


Tyr 


Thr 


Arg 


Glu 


His 


Phe 


His 


Tyr 


Thr Val 


Thr Asp He 


Lys 








805 










810 




815 




Asp 


Leu 


Thr 


Lys 


Leu 


Gly 


Ala 


Thr 


Tyr 


Asp 


Asp Ala 


Lys Lys Cys 


Trp 






820 










825 






830 




Val 


Tyr 


Gin 


Gly 


Lys 


Pro 


Val 


Met 


Pro 


Asp 


<31n Phe 


Thr Phe Glu 


Leu 




835 






84 0 








845 




Leu 


Asp 


Phe 


Leu 


His 


Gin 


Leu 


Thr 


His 


Leu 


Ser Phe 


Ser Lys Thr 


Lys 




850 










855 








860 






Ala 


Leu 


Leu 


Glu 


Arg 


Ser 


Tyr 


Ser 


Pro 


Ser 


Tyr Met 


Leu Asn Arg 


Asp 


865 








870 










875 




880 


Arg 


Thr 


Leu 


Lys 


Asp 


He 


Thr 


Glu 


Thr 


Cys 


Lys Ala 


Cys Ala Gin 


Val 








885 










890 




895 




Asn 


Ala 


Ser 


Lys 


Ser 


Ala 


Val 


Lys 


Gin 


Gly 


Thr Arg 


Val Arg Gly 


His 








900 










905 






910 




Arg 


Pro 


Gly 


Thr 


His 


Trp 


Glu 


He 


Asp 


Phe 


Thr Glu 


Val Lys Pro 


Gly 




915 










920 








925 




Leu 


Tyr 


Gly 


Tyr 


Lys 


Tyr 


Leu 


Leu 


Val 


Phe 


Val Asp 


Thr Phe Ser 


Gly 




930 










935 








940 






Trp 


Val 


Glu 


Ala 


Phe 


Pro 


Thr 


Lys 


Lys 


Glu 


Thr Ala 


Lys val Val 


Thr 


945 










950 










955 




960 


Lys 


Lys 


Leu 


Leu 


Glu 


Glu 


He 


Phe 


Pro 


Arg 


Phe Gly 


Met Pro Gin 


Val 






965 










970 




975 




Leu 


Gly 


Thr 


Asp 


Asn 


Gly 


Pro 


Ala 


Phe 


Val 


Ser Lys 


Val Ser Gin 


Thr 






980 










985 






990 




Val 


Ala 


Asp 


Leu 


Leu 


Gly 


Val 


Asp 


Trp 


Lys 


Leu His 


Cys Ala Tyr 


Arg 






995 








1000 






1005 




Pro 


Gin 


Ser 


Ser 


Gly 


Gin 


Val 


Glu 


Arg 


Met 


Asn Arg 


Thr He Lys 


Glu 


1010 






1015 








1020 






Thr 


Leu 


Thr 


Lys 


Leu 


Thr 


Leu 


Ala 


Thr 


Gly 


Ser Arg Asp Trp Val 


Leu 


025 






1030 








1035 


1040 


Leu 


Leu 


Pro 


Leu 


Ala 


Leu 


Tyr 


Arg 


Ala 


Arg Asn Thr Pro Gly Pro His 








1045 








1050 




1055 




Glv Leu 


Thr 


Pro 


Tyr 


Glu 


He 


Leu 


Tyr Gly Ala Pro 


Pro Pro Leu 


Val 






1060 






1065 






1070 




Asn 


Phe 


Pro 


Asp 


Pro Asp 


Met 


Ala 


Lys 


Val 


Thr His 


Asn Pro Ser 


Leu 



1075 " 1080 1085 

Gin Ala His Leu Gin Ala Leu Tyr Leu Val Gin His Glu Val Trp Arg 



1090 1095 HOO 

Pro Leu Ala Ala Ala Tyr Gin Glu Gin Leu Asp Arg Pro Val Val Pro 
105 1H0 1115 1120 

His Pro Phe Arg Val Gly Asp Thr Val Trp Val Arg Arg His Gin Thr 
1125 H30 1135 
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Lys Asn Leu Glu Pro Arg Trp Lys Gly Pro Tyr Thr Val Leu Leu Thr 

1140 1145 1150 

Thr Pro Thr Ala Leu Lys Val Asp Gly lie Ala Ala Trp lie His Ala 

1155 1160 1165 

Ala His Val Lys Ala Ala Asp Thr Arg lie Glu Pro Pro Ala Glu Ser 

1170 1175 1180 

Thr Trp Arg Val Gin Arg Ser Gin Asn Pro Leu Lys lie Arg Leu Thr 
185 1190 1195 1200 

Arg Gly Thr Ser 

1 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 0 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 





(vi) ORIGINAL 


SOURCE : 


















(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO:30: 






Met 


Asp 


Ser 


Ala 


Aia 


Pro 


Aia 


Leu 


oer 


Pro 


ax a 


Leu 


Thr Ala Leu 


Thr 


1 






5 










10 






15 




Asp 


(iin 


Ser 


a j. a 


1 LIT 


fcl Si 


Asp 


Leu 


nla 


Tip 

lie 




He 


Pro Lys Cys 


Pro 






"5 A 


















30 




Asp 


Fro 


ulU 


Arg 


Tyr 




Tyr 


i. Iii 


Cor 


OX 11 


cys 


Pro Asp He Asn 


His 






35 










H 0 










45 




Leu 


Arg 


Ser 


Leu 


ser 


lie 


Leu 


Asn 


Arg 


Trp 


Leu 


Glu 


Thr Glu Leu 


Val 




50 










c c 

3D 










60 






Phe 


Val 

v ax 


Glv 


Asp 


Glu 


Glu 


Asp 


Val 


Ser 


Lvs 


Leu 


Ser Glu Gly Glu Leu 


65 






70 










75 






80 


Ser 


Phe 


Tyr 


Arg 


Phe 


Leu 


Phe 


Ala 


Phe 


Leu 


Ser 


Ala Ala Asp Asp 


Leu 






85 










90 






95 




Val 


Thr 


Glu 


Asn 


Leu 


Gly 


Gly 


Leu 


Ser 


Gly 


Leu 


Phe 


Glu Gin Lys 


Asp 








100 










105 








110 




He 


Leu 


His 


Tyr 


Tyr 


Val 


Glu 


Gin 


Glu 


Cys 


He 


Glu 


Val Val His 


Ser 






115 






120 










125 




Arg 


Val 


Tyr 


Asn 


He 


He 


Gin 


Leu 


Val 


Leu 


Phe 


His 


Asn Asn Asp 


Gin 


130 








135 










140 






Ala 


Arg 


Arg 


Glu 


Tyr 


Val 


Ala 


Gly 


Thr 


He 


Asn 


His 


Pro Ala lie 


Arg 


145 






150 










155 






160 


Ala 


Lys 


Val 


Asp 


Trp 


Leu 


Glu 


Ala 


Arg 


Val 


Arg 


Glu 


Cys Ala Ser 


Val 






165 










170 






175 




Pro 


Glu 


Lys 


Phe 


He 


Leu 


Met 


He 


Leu 


He 


Glu 


Gly 


He Phe Phe 


Ala 






180 










185 








190 




Ala 


Ser 


Phe 


Ala 


Ala 


He 


Ala 


Tyr 


Leu 


Arg 


Thr 


Asn 


Asn Leu Leu Arg 






195 










200 










205 




Val 


Thr 


Cys 


Gin 


Ser 


Asn 


Asp 


Leu 


He 


Ser 


Arg 


Asp 


Glu Ala Val 


His 




210 








215 










220 






Thr 


Thr 


Ala 


Ser 


Cys 


Tyr 


He 


Tyr 


Asn 


Asn 


Tyr 


Leu Gly Gly His Ala 


225 








230 










235 






240 


Lys 


Pro 


Pro 


Pro 


Asp 


Arg 


Val 


Tyr 


Gly 


Leu 


Phe 


Arg 


Gin Ala Val 


Glu 








245 










250 






255 




He 


Glu 


He 


Gly 


Phe 


He 


Arg 


Ser 


Gin 


Ala 


Pro 


Thr Asp Ser His 


He 








260 










265 








270 




Leu 


Ser 


Pro 


Ala 


Ala 


Leu 


Ala 


Ala 


He 


Glu 


Asn 


Tyr Val Arg Ph 


Ser 






275 










280 










265 




Ala 


Asd 


Arg 


Leu 


Leu 


Gly 


Leu 


He 


His 


Met 


Lys 


Pro 


Leu Phe Ser 


Ala 




290 








295 










300 






Pro 


Pro 


Pro 


Asp 


Ala 


Ser 


Phe 


Pro 


Leu 


Ser 


Leu 


Met 


Ser Thr Asp 


Lys 


305 








310 










315 






320 


His 


Thr 


Asn 


Phe 


Phe 


Glu 


Cys 


Arg 


Ser 


Thr 


Ser 


Tyr Ala Gly Ala Val 










325 










330 






335 




Val 


Asn 


Asp 


Leu 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
<iii) HYPOTHETICAL: NO 

<iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



Met 


Asp 


Pro Ala 


Val 


Ser 


Pro 


Ala 


Ser 


Thr Asp Pro 


Leu Asp 


Thr 


His 


1 






5 










10 






15 




Ala 


Ser 


Gly Ala 


Gly 


Ala 


Ala 


Pro 


He 


Pro Val Cys 


Pro 


Thr 


Pro 


Glu 






20 










25 






30 






Arg 


Tyr 


Phe Tyr 


Thr 


Ser 


Gin 


Cys 


Pro 


Asp He Asn 


His 


Leu Arg 


Ser 






35 








40 






45 








Leu 


Ser 


He Leu 


Asn 


Arg 


Trp 


Leu 


Glu 


Thr Glu Leu 


Val 


Phe 


Val 


Gly 




50 








55 






60 










Asp 


Glu 


Glu Asp 


Val 


Ser 


Lys 


Leu 


Ser 


Glu Gly Glu 


Leu Gly Phe 


Tyr 


65 








70 








75 








80 


Arg 


Phe 


Leu Phe 


Ala 


Phe 


Leu 


Ser 


Ala 


Ala Asp Asp 


Leu 


Val 


Thr 


Glu 








85 










90 






95 




Asn 


Leu 


Gly Gly 


Leu 


Ser 


Gly 


Leu 


Phe 


Glu Gin Lys 


Asp 


He 


Leu 


His 






100 










105 






110 






Tyr 


Tyr 


Val Glu 


Gin 


GlU 


Cys 


He 


Glu 


Val Val His 


Ser Arg Val 


Tyr 






115 








120 






125 








Asn 


He 


He Gin 


Leu 


Val 


Leu 


Phe 


His 


Asn Asn Asp 


Gin Ala Arg 


Arg 




130 








135 






140 










Ala 


Tyr 


Val Ala 


Arg 


Thr 


He 


Asn 


His 


Pro Ala He 


Arg Val 


Lys 


Val 


145 








150 








155 








160 


Asp 


Trp 


Leu Glu 


Ala 


Arg 


Val 


Arg 


Glu 


Cys Asp Ser 


He 


Pro 


Glu 


Lys 








165 










170 






175 




Phe 


He 


Leu Met 


He 


Leu 


He 


Glu 


Gly 


Val Phe Phe 


Ala 


Ala 


Ser 


Phe 






180 










185 






190 






Ala 


Ala 


He Ala 


Tyr 


Leu 


Arg 


Thr 


Asn 


Asn Leu Leu 


Arg 


Val 


Thr 


Cys 






195 








200 






205 








Gin 


Ser 


Asn Asp 


Leu 


He 


Ser 


Arg 


Asp 


Glu Ala Val 


His 


Thr 


Thr 


Ala 




210 








215 






220 










Ser 


Cys 


Tyr He 


Tyr 


Asn 


Asn 


Tyr 


Leu 


Gly Gly His 


Ala 


Lys 


Pro 


Glu 


225 








230 








235 








240 


Ala 


Ala 


Arg Val 


Tyr 


Arg 


Leu 


Phe 


Arg 


Glu Ala Val 


Asp 


He 


Glu 


He 








245 










250 






255 




Gly 


Phe 


He Arg 


Ser 


Gin 


Ala 


Pro 


Thr 


Asp Ser Ser 


He 


Leu 


Ser 


Pro 




260 










265 






270 






Gly 


Ala 


Leu Ala 


Ala 


He 


Glu 


Asn 


Tyr 


Val Arg Phe 


Ser 


Ala 


Asp 


Arg 




275 








280 






285 








Leu 


Leu 


Gly Leu 


He 


His 


Met 


Gin 


Pro 


Leu Tyr Ser 


Ala 


Pro 


Ala 


Pro 




290 






295 






300 










Asp 


Ala 


Ser Phe 


Pro 


Leu 


Ser 


Leu 


Met 


Ser Thr Asp 


Lys 


His 


Thr 


Asn 


305 








310 








315 








320 


Phe 


Phe 


Glu Cys 


Arg 


Ser 


Thr 


Ser 


Tyr 


Ala Gly Ala 


Val 


Val 


Asn 


Asp 






325 










330 






335 




Leu 



























(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 302 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



SUBSTITUTE SHEET (RULE 26) 



WO 97/01578 



PCTYUS96/10958 



•96- 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 





(xi) SEQUENCE 


DESCRIPTION; 


: SEQ ID 


NO:32: 


Met 


Ser 


Lys 


Leu 


Leu 


Tyr 


Val 


Arg 


Asp 


His 


Glu Gly Phe Ala Cys Leu 








5 










10 


15 




Val 


Glu 


Thr 


His 


Arg 


Asn 


Arg 


Trp 


Phe 


Ala Ala His He Val Leu 








20 














30 


Thx 


Lys 


Asp 


Cys 


Gly 


Cys 


Leu 


Lys 


Leu 


Leu 


Asn Glu Arg Asp Leu Glu 




35 










40 






45 


Phe 


Tyr 


Lys 


Phe 


Leu 


Phe 


Thr 


Phe 


Leu 


Ala 


Met Ala Glu Lys Leu Val 




50 








55 








60 


/tail 


Phe 


Asn 


He 


Asp 


Glu 


Leu 


Val 


Thr 


Ser 


Phe Glu Ser His Asp He 


65 








70 










75 80 


Asp 


His 


Tyr 


Tyr 


Thr 


Glu 


Gin 


Lys 


Ala 


Met 


Glu Asn Val His Gly Glu 




85 










90 


95 


xnr 


Tyr 


Ala 


Asn 


He 


Leu 


Asn 


Met 


Leu 


Phe 


Asn Glv Asd Arcr Ala Ala 






100 










105 




110 


Met 


Asn 


Ala 


Tyr 


Ala 


Glu 


Ala 


He 


Met 


Ala 


Asp Glu Ala Leu Gin Ala 






115 








120 






125 


Lys 


He 


Ser 


Trp 


Leu 


Arg 


Asp 


Lys 


val 


Ala 


Ala Ala Val Thr Leu Pro 


130 










135 








140 


Glu 


Lys 


He 


Leu 


Val 


Phe 


Leu 


Leu 


He 


Glu 


fil v Tie Phe Phe He Ser 

uiy lie lie rue mwj- 


145 








150 










155 160 


Ser 


Phe 


Tyr 


Ser 


He 


Ala 


Leu 


Leu 


Arg 


Val 


A-rri rjl \r T,m l Met Pro Glv 
niy uxy ricu nw j 








165 










170 


175 


ne 


Cys 


Leu 


Ala 


Asn 


Asn 


Tyr 


He 


Ser 


Arg 


Asp Glu Leu Leu His Thr 






180 










185 




190 


Arg 


Ala 


Ala 


Ser 


Leu 


Leu 


Tyr 


Asn 


Ser 


Met 


Thr Ala Lvs Ala Asp Arg 




195 










200 






205 


Pro 


Arg 


Ala 


Thr 


Trp 


He 


Gin 


Glu 


Leu 


Phe 


Arg Thr Ala Val Glu Val 




210 










215 








220 


Glu 


Thr 


Ala 


Phe 


He 


Glu 


Ala 


Arg 


Gly 


Glu 


Gly Val Thr Leu Val Asp 


225 










230 










235 240 


Val 


Arg 


Ala 


He 


Lys 


Gin 


Phe 


Leu 


Glu 


Ala 


Thr Ala Asp Arg He Leu 








245 










250 


255 


Gly 


Asp 


He 


Gly 


Gin 


Ala 


Pro 


Leu 


Tyr 


Gly 


Thr Pro Pro Pro Lys Asp 




260 










265 




270 


Cys 


Pro 


Leu 


Thr 


Tyr 


Met 


Thr 


Ser 


He 


Lys 


Gin Thr Asn Phe Phe Glu 




275 










280 






285 


Gin 


Glu 


Ser 


Ser 


Asp 


Tyr 


Thr 


Met 


Leu 


Val 


Val Asp Asp Leu 



290 295 300 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Leu Ser Leu Arg Val Pro Leu Ala Pro lie Thr Asp Pro Gin Gin 

15 10 15 

Leu Gin Leu Ser Pro Leu Lys Gly Leu Ser Leu Val Asp Lys Glu Asn 

20 25 30 

Thr Pro Pro Ala Leu Ser Gly Thr Arg Val Leu Ala Ser Lys Thr Ala 

35 40 45 

Arg Arg lie Phe Gin Glu Pro Thr Glu Pro Lys Thr Lys Ala Ala Ala 
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50 










jj 








60 


Pro 


Gly 


Val 


Glu Asp 




Pro* 


Leu 


T Oil A TTT 


Glu 


a cm Pro Arcr Arcr Phe 


65 








/ LI 








75 


80 


Val 


lie 


Pne 


Pro 


He 


uiU 


Tyr 


Mi «; 
nx 0 


21 cn Tie 
ridL) xx c 




Gin Me t Tvr L vs Lvs 










85 








90 




95 


Ala 


Glu 


nit 

Ala 


Ser 


Phe 


Trp 


1 Hi. 


Ala 


V31U wXU 


Val 


Asp Leu Ser Lys Asp 








100 








105 




110 


lie 


Gin 


His 


Trp 


Glu 


Ser 


Leu 


Lys 


Pro Glu 


L3X Li 


IVyvt Tvr Php Tip Qpr 

A-XLJ 1 yi XrilC XXC »J w X 






115 








xx u 






125 


His 


Val 


Leu 


Ala 


Phe 


Fne 


iixci 


>vJLcl 


C* i-v -y 7\ on 
O CI .rlojJ 




T1p VaI A^n Glu Asn 

XXC VOX Awll wX IX 




130 










17c 
IJ j 








140 


Leu 


Val 


Glu 


Arg 


Phe 


Ser 


r*l n 


UX Li 


VcLX OX11 


Tip 
X X c 


Thr* Glu Ala Arcr Cvs 


145 








150 








155 


160 


Pne 


Tyr 


Gly Phe Gin 


Tl a 
JL le 


Ala 


Met 


Tii lien 

L3XLL noli 


He 


His Ser Glu Met Tvr 








165 








170 




175 


Ser 


Leu 


Leu 


He 


Asp 


xXlx 


Tyr 


He 


XJ J O 


Pro 


Lys Glu Arg Glu Phe 








180 








185 




190 


Leu 


Pne 


Asn 


Ala 


He 




Thr 


Met 




Val 
v ax 


Lvs Lvs Lvs Ala ASD 






195 










200 






205 


Trp 


Ala 


Leu Arg 


Trp 


Tl a 


Gly Asp 


T.ve f2l 11 
ijyo ox u 


Ala 


Thr Tvr Glv Glu Arcr 


210 










215 








220 


Val 


Val 


Ala 


Phe 


Ala 


Aid 


Val 


Glu 


biy lie 


IT XitT 


Phe Ser Glv Ser Phe 


225 


















235 


240 


Ala 


Ser 


He 


Phe 


Trp 


Leu 


Lys 


Lys 


& y /"I 0 1 \r 

ntg oiy 


XjCU 


Met Pro Glv Leu Thr 

lie? Lp r wxy 1* x 4Ai> 










245 












255 


Phe 


Ser 


Asn 


Glu 


Leu 


lie 


Ser Arg 




vjxy 


T,pn Hi ^ Cvs Asn Phe 

UCU ixXS LyO n0^* 






260 














270 


Ala 


Cys 


Leu 


Met 


Phe 


Lys 


His 


Leu 


val nlS 


Lys 


0*-/% Cat- nin m 11 XVt-er 
fro acx v?xu oiu niy 




275 










280 








Val 


Arg 


Glu 


He 


He 


He 


Asn 


Ala 


Val Arg 


T 1 A 

lie 


blu uin ulu file ucu 




290 










295 








J w W 


Thr 


Glu 


Ala 


Leu 


Pro 


Val 


Lys 


Leu 


He Gly 


Met 


Asn Lrys 1 nr Leu ne u 


305 










310 








315 


320 


Lys 


Gin 


Tyr 


He 


Glu 


Phe 


Val 


Ala 


/*x y 


Leu 


Met Leu Glu Leu Glv 






325 








330 




335 


Pne 


ber 


Lys 


Val 


Phe Arg 


Val 


Glu 


Asn Pro 


Phe 


Asp Phe Met Glu Asn 






340 










345 




350 


He 


Ser 


Leu Glu Gly Lys 


Thr 


Asn 


Phe Phe 


Glu 


Lys Arg Val Gly Glu 






355 










360 






365 


Tyr 


Gin 


Arg Met Gly Val 


Met 


Ser 


Ser Pro 


Thr 


Glu Asn Ser Phe Thr 


370 










375 








380 


Leu 


Asp 


Ala 


Asp 


Phe 














385 























(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Glu Pro He Leu Ala Pro Asn Pro Asn Arg Phe Val He Phe Pro 

1 5 10 15 

He Gin Tyr Tyr Asp He Trp Asn Met Tyr Lys Lys Ala Glu Ala Ser 

20 25 30 

Phe Trp Thr Val Glu Glu Val Asp He Ser Lys Asp He Asn Asp Trp 

35 40 45 

Asn Lys Leu Thr Pro Asp Glu Lys Tyr Phe He Lys His Val Leu Ala 

50 55 60 

Phe Phe Ala Ala Ser Asp Gly He Val Asn Glu Asn Leu Ala Glu Arg 
65 70 75 80 
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Thr 


Glu 


Val 


Gin 


He 


Thr 


Glu 


Ala 


Arg 


Cys Phe Tyr 


Glv 


Phe 








85 










90 






95 




Gin 


Met 


Ala 


He 


Glu 


Asn 


He 


His 


Ser 


Glu 


Met 


Tyr Ser Leu 


Leu 


He 








100 










105 






110 






ASp 


lux 


1 VA 


Val 

v ax 




Asp 


Ser 


Asn 


Glu 


Lvs 


Asn Tyr Leu Phe 


Asn 


Ala 




lis 










120 








125 






lie 


ulu 


1 XI XT 


ncL 


Drn 




Val 


Lvs 


Lvs 


Lys 


Ala Asp Trp Ala 


Gin 


Lvs 




1 ^0 
uu 










135 










140 






Trp 


lie 


Hi e 

rixs 


Asp 


ser 


Ala 


Gly 


Tvr 


Glv 


Glu 


Arg 


Leu He Ala 


Phe 


Ala 


145 








150 










155 






160 


Ala 


Val 


rl T » 
Ul u 


fil V 
u A. y 


He 


Phe 


Phe 


Ser 


Glv 


Ser 


Phe 


Ala Ser He 


Phe 


Trp 








165 










170 






175 






Lys 


Lys 


A*TCT 


Glv 

wx y 


Leu 


Met 


Pro 


Gly 


Leu 


Thr 


Phe Ser Asn 


Glu 


Leu 




180 










185 






190 






xxc 


Ser 




Asp 


Glu 


GlV 


Leu 


His 


Cys 


Asp 


Phe Ala Cys Leu 


Met 


Phe 






195 








200 








205 






T A/C 

Xjjt s 


Hi 


Leu 


Leu 


His 


Pro 


Pro 


Ser 


Glu 


Glu 


Thr 


Val Arg Ser 


He 


He 












215 










220 






inx 


Asp 


Ala 


v ax 




He 


Glu 


Gin 


Glu 


Phe 


Leu 


Thr Ala Ala 


Leu 


Pro 


225 








230 










235 






240 


Veil 


Lys 


Leu 


lie 


uiy 


Mot* 


A en 




Glu 


Met 


Met 


Lys Thr Tyr 


He 


Glu 


















250 






255 




Phe 


Val 


Ala 


Asp 


Arg 


Leu 


He 


Ser 


Glu 


Leu 


Gly 


Phe Lys Lys 


He 


Tyr 








260 








265 






270 






Asn 


Val 


Thr 


Asn 


Pro 


Phe 


Asp 


Phe 


Met 


Glu 


Asn 


He Ser Leu 


Glu 


Gly 






275 










280 








285 






Lys 


Thr 


Asn 


Phe 


Phe 


Glu 


Lys 


Arg 


val 


Gly 


Glu Tyr Gin Lys 


Met 


Glv 


290 










295 










300 






val 


Met 


Ser 


Gin 


Glu 


Asp 


Asn 


His 


Phe 


Ser 


Leu Asp Val Asp 


Phe 




*S A C 










310 










315 












(2) 


INFORMATION FOR SEQ ID 


NO:35: 










(i) SEQUENCE CHARACTERISTICS: 
















(A) 


LENGTH: 


390 


amino acids 
















(B) 


TYPE : amino 


acid 


















(C) 


STRANDEDNESS : single 
















(D) 


TOPOLOGY : 1 inear 
















(ii) MOLECULE 


TYPE : peptide 














(iii) 


HYPOTHETICAL: NO 
















<iv) ANT I SENSE : NO 


















(v) FRAGMENT TYPE: 


: internal 














(vi) ORIGINAL 


SOURCE : 


















(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 35: 






Met 


Leu 


Ser 


Val 


Arg 


Thr 


Pro 


Leu 


Ala 


Thr 


He 


Ala Asp Gin 


Gin 


Gin 


1 








5 










10 






15 




Leu 


Gin 


Leu 


Ser 


Pro 


Leu 


Lys 


Arg 


Leu 


Thr 


Leu 


Ala Asp Lys 


Glu 


Asn 








20 










25 






30 






Thr 


Pro 


Pro 


Thr 


Leu 


Ser 


Ser 


Thr 


Arg 


Val 


Leu 


Ala Ser Lys 


Ala 


Ala 






35 










40 








45 






Arg 


Arg 


He 


Phe 


Gin 


Asp 


Ser 


Ala 


Glu 


Leu 


Glu 


Ser Lys Ala 


Pro 


Thr 


50 










55 










60 






Asn 


Pro 


Ser 


Val 


Glu 


Asp 


Glu 


Pro 


Leu 


Leu 


Arg 


Glu Asn Pro 


Arc 


Arg 


65 










70~ 










75 






80 


Phe 


Val 


Val 


Phe 


Pro 


He 


Glu 


Tyr 


His 


Asp 


He 


Trp Gin Met 


Tvr 


Lys 








85 










90 






95 




Lys 


Ala 


Glu 


Ala 


Ser 


Phe 


Trp 


Thr 


Ala 


Glu 


Glu 


Val Asp Leu 


Ser 


Lvs 






100 










105 






110 






Asp 


lie 


Gin 


His 


Trp 


Glu 


Ala 


Leu 


Lys 


Pro 


Asp 


Glu Arg His 


Phe 


He 




115 








120 








125 




Glu 


Ser 


His 


Val 


Leu 


Ala 


Phe 


Phe 


Ala 


Ala 


Ser 


Asp 


Gly He Val 


Asn 




130 










135 










140 






Asn 


Leu 


Val 


Glu 


Arg 


Phe 


Ser 


Gin 


Glu 


Val 


Gin 


Val Thr Glu 


Ala 


Arg 


145 








150 










155 






160 


Cys 


Phe 


Tyr 


Gly 


Phe 


Gin 


He 


Ala 


Met 


Glu 


Asn 


He His Ser 


Glu 


Met 




165 










170 






175 




Tyr 


Ser 


Leu 


Leu 


He 


Asp 


Thr 


Tyr 


lie 


Lys 


Asp 


Pro Lys Glu 


Arg 


Glu 
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180 










185 






190 






Tyr 


Leu 


Phe 


Asn 


Ala 


He 


Glu 


Thr 


Met 


Pro 


Cys Val Lys 


Lys 


Lys 


Ala 




195 










200 






2 05 








Asp 


Trp 


Ala 


Leu 


Arg 


Trp 


He 


Gly 


Asp 


Lys 


Glu Ala Thr 


Tyr 


Gly 


Glu 


210 










215 
















Arg 


Val 


Val 


Ala 


Phe 


Ala 


Ala 


Val 


Glu 


Gly 


He Pne Pne 


Ser 


uiy 


Ser 


225 










230 










O *1 c 








Phe 


Ala 


Ser 


He 


Phe 


Trp 


Leu 


Lys 


Lys 


Arg 


Gly Leu Met 


Pro 


«j.y 


Leu 










245 










250 










Thr 


Phe 


Ser 


Asn 


Glu 


Leu 


He 


Ser 


Arg 


Asp 


Glu Gly Leu 


IllS 


Cys 


Asp 








260 










o c c 






O *7f1 
Z / U 






Phe 


Ala 


Cys 


Leu 


Met 


Phe 


Lys 


His 


Leu 


Val 


His Lys Pro 


Ala 


GlU 


Gin 






275 










280 






i a e 
285 








Arg 


val 


Arg 


Glu 


He 


He 


Thr 


Asn 


Ala 


Val 


Arg He Glu 


Gin 


Glu 


Phe 


290 








295 








300 








Leu 


Thr 


Glu 


Ala 


Leu 


Pro 


Val 


Lys 


Leu 


He 


Gly Met Asn 


Cys 


Thr 


Leu 


305 










310 










315 






320 


Met 


Lys 


Gin 


Tyr 


He 


Glu 


Phe 


Val 


Ala 


Asp 


Arg Leu Met 


Leu 


Glu 


Leu 






325 










330 










Gly 


Phe 


Asn 


Lys 


He 


Phe 


Arg 


17-* T 

Val 


Glu 


Asn 


pro pne Asp 


pne 


rlcL 


ni ii 

ulu 






340 










345 






350 






Asn 


lie 


Ser 


Leu 


Glu 


Gly 


Lys 


Thr 


Asn 


Phe 


Phe Glu Lys 


Arg 


Val 


Gly 






355 










360 






365 








Glu 


Tyr 


Gin 


Arg 


Met 


Gly 


Val 


Met 


Ser 


Asn 


Ser Thr Glu 


Asn 


Ser 


Phe 




370 










375 








380 








Thr 


Leu 


Asp 


Ala 


Asp 


Phe 


















385 










3 90 



















(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



Met 


Pro 


Lys 


Glu 


Thr 


Pro 


Ser 


Lys 


Ala 


Ala 


Ala 


Asp Ala 


Leu 


Ser 


Asp 


1 






5 










10 








15 




Leu 


Glu 


He 


Lys 
20 


Asp 


Ser 


Lys 


Ser 


Asn 
25 


Leu 


Asn 


Lys Glu 


Leu 
30 


Glu 


Thr 


Leu 


Arg 


Glu 


Glu 


Asn 


Arg 


Val 


Lys 


Ser 


Asp 


Met 


Leu Lys 


Glu 


Lys 


Leu 




35 










40 








45 








Ser 


Lys 


Asp 


Ala 


Glu 


Asn 


His 


Lys 


Ala 


Tyr 


Leu 


Lys Ser 


His 


Gin 


Val 




50 








55 










60 








His 


Arg 


His 


Lys 


Leu 


Lys 


Glu 


Met 


Glu 


Lys 


Glu 


Glu Pro 


Leu 


Leu 


Asn 


65 






70 










75 








60 


Glu 


Asp 


Lys 


Glu 


Arg 


Thr 


Val 


Leu 


Phe 


Pro 


He 


Lys Tyr 


His 


Glu 


He 






65 










90 








95 




Trp 


Gin 


Ala 


Tyr 


Lys 


Arg 


Ala 


Glu 


Ala 


Ser 


Phe 


Trp Thr 


Ala 


Glu 


Glu 






100 










105 








110 




Glu 


He 


Asp 


Leu 


Ser 


Lys 


Asp 


He 


His 


Asp 


Trp 


Asn 


Asn Arg 


Met 


Asn 




115 










120 








125 








Asn 


Glu 


Arg 


Phe 


Phe 


He 


Ser 


Arg 


Val 


Leu 


Ala 


Phe Phe 


Ala 


Ala 


Ser 




130 








135 










140 








Asp 


Gly 


He 


Val 


Asn 


Glu 


Asn 


Leu 


Val 


Glu 


Asn 


Phe Ser 


Thr 


Glu 


Val 


145 








150 










155 






He 


160 


Gin 


He 


Pro 


Glu 


Ala 


Lys 


Ser 


Phe 


Tyr 


Gly 


Phe 


Gin He 


Met 


Glu 








165 








170 








175 




Asn 


He 


His 


Ser 


Glu 


Thr 


Tyr 


Ser 


Leu 


Leu 


He 


Asp Thr 


Tyr 


He 


Lys 






180 








185 








190 






Asp 


Pro 


Lys 


Glu 


Ser 


Glu 


Phe 


Leu 


Phe 


Asn 


Ala 


He His 


Thr 


lie 


Pro 




195 










200 








205 
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Glu 


He 


Glv 


Glu 


Lvs 


Ala 


Glu 


Trp 


Ala 


Leu 


Arg 


Trp 


He Gin Asp 


Ala 




210 








215 










220 








Asp 


Ala 


Leu 


Phe 


Gly 


Glu 


Arg 


Leu 


Val 


Ala 


Phe 


Ala 


S r He 


Glu 


Gly 


225 








230 










235 








240 


Val 


Phe 


Phe 


Ser 


Gly 


Ser 


Phe 


Ala 


Ser 


He 


Phe 


Trp 


Leu Lys 


Lys 


Arg 










245 










250 








255 




GlV 


Met 


Met 


Pro 


Gly 


Leu 


Thr 


Phe 


Ser 


Asn 


Glu 


Leu 


lie Cys 


Arq 


Asp 






260 










265 








270 






Glu 


Glv 


Leu 


His 


Thr 


Asd 


Phe 


Ala 


Cvs 


Leu 


Leu 


Phe 


Ala His 


Leu 


Lys 




275 










280 










285 






Asn 


Lys 


Pro 


Asp 


Pro 


Ala 


lie 


Val 


Glu 


Lvs 


He 


Val 


Thr Glu 


Ala 


Val 




290 










295 










300 








Glu 


He 


Glu 


Gin 




Tvr 


Phe 


Leu 


Asp 


Ala 


Leu 


Pro 


Val Ala 


Leu 


Leu 


305 








310 










315 








320 


Glv 


Met 


Asn 


Ala 


Asd 


Leu 


Met 


Asn 


Gin 


Tyr 


Val 


Glu 


Phe Val 


Ala 


Asp 








325 










330 








335 




Arg 


Leu 


Leu 


Val 


Ala 


Phe 


Gly 


Asn 


Lys 


Lys 


Tyr 


Tyr 


Lys Val 


Glu 


Asn 






340 










345 








350 






Pro 


Phe 


Asp 


Phe 


Met 


Glu 


Asn 


He 


Ser 


Leu 


Ala Gly 


Lys Thr 


Asn 


Phe 






355 










360 










365 






Phe 

tr lie 


Glu 


Lys 


Arg 


Val 


Ser 


Asp 


Tyr 


Gin 


Lys 


Ala Gly 


Val Met 




Lys 




370 










375 










380 








Cot* 
OCX 


Thr 


Lys 


Gin 


Glu 


Ala 


Gly 


Ala 


Phe 


Thr 


Phe 


Asn 








IOC 
JOS 








390 










395 














(2) 


INFORMATION FOR SEQ ID 


NO:37: 












(i) SEQUENCE CHARACTERISTICS: 


















(A) 


LENGTH: 


375 


amino acids 


















(B) 


TYPE : amino 


acid 




















<C> 


STRANDEDNESS : single 


















(D) 


TOPOLOGY : 1 inear 


















(ii) MOLECULE 


TYPE: peptide 
















(iii) 


HYPOTHETICAL: NO 


















(iv) ANT I SENSE : NO 




















(v) FRAGMENT TYPE: 


: internal 
















(vi) ORIGINAL 


SOURCE : 




















(xi) SEQUENCE 


DESCRIPTION: 


; SEQ ID 


NO: 37: 








Rl a 


Tyr 


Thr 


Thr 


Phe 


Ser 


Gin 


Thr 


Lys 


Asn 


Asp 


Gin 


Leu Lys 


Glu 


Pro 


X 






5 










10 








15 




Mat- 
Wet. 


Phe 


Phe 


Gly 


Gin 


Pro 


val 


Asn 


Val 


Ala 


Arg 


Tyr 


Asp Gin Gin 


Lvs 








20 










25 








30 






xyr 


Asp 


He 


Phe 


Glu 


Lys 


Leu 


He 


Glu 


Lys 


Gin 


Leu 


Ser Phe 


Phe 


Trp 


35 










40 










45 






Arg 


Pro 


Glu 


Glu 


Val 


Asp 


Val 


Ser 


Arg 


Asp 


Arg 


He 


Asp Tyr 


Gin 


Ala 


50 










55 










60 








Leu 


Pro 


Glu 


His 


Glu 


Lys 


His 


He 


Phe 


He 


Ser 


Asn 


Leu Lys 


Tyr 


Gin 


O D 










70 










75 








80 


inr 


Leu 


Leu 


Asp 


Ser 


He 


Gin 


Gly 


Arg 


Ser 


Pro 


Asn 


Val Ala 


Leu 


Leu 








65 










90 








95 




Pro 


Leu 


He 


Ser 


He 


Pro 


Glu 


Leu 


Glu 


Thr 


Trp 


Val 


Glu Thr 


Trp 


Ala 








100 










105 








110 






riie 


Ser 


Glu 


Thr 


He 


His 


Ser 


Arg 


Ser 


Tyr 


Thr 


His 


lie lie 


Arg 


Asn 






115 










120 










125 






lie 


Val 


Asn 


Asp 


Pro 


Ser 


Val 


Val 


Phe 


Asp 


Asp 


He 


Val Thr 


Asn 


Glu 




130 








135 










140 




Glu 






He 


Gin 


Lys 


Arg 


Ala 


Glu 


Gly 


He 


Ser 


Ser 


Tyr 


Tyr Asp 


Leu 


145 








150 










155 








160 


lie 


Glu 


Met 


Thr 


Ser 


Tyr 


Trp 


His 


Leu 


Leu 


Gly 


Glu 


Gly Thr His 


Thr 










165 








170 








175 




Val 


Asn 


Gly 


Lys 


Thr 


Val 


Thr 


Val 


Ser 


Leu 


Arg 


Glu 


Leu Lys 


Lys 


Lys 






180 










185 








190 






Leu 


Tyr 


Leu 


Cys 


Leu 


Met 


Ser 


Val 


Asn 


Ala 


LC a 


Glu 


Ala lie 


Arg 


Phe 




155 








200 










205 






Tyr 


Val 


Ser 


Phe 


Ala 


Cys 


Ser 


Phe 


Ala 


Phe 


Ala 


Glu 


Arg Glu 


Leu 


Met 


210 










215 










220 








Glu 


Gly 


Asn 


Ala 


Lys 


He 


He 


Arg 


Leu 


He 


Ala 


Arg 


Asp Glu Ala 


Leu 
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225 






U 






His 


Leu 


Thr 


Gly Thr Gin 
24 b 


HIS 


Met 


Asp 


Asp 


Pro 


Glu Met Ala 
260 


GlU 


lie 


Tyr 


Asp 


Leu 


Phe Val Gin 


Ala 


Ala 


275 








Tyx 


Leu 


Phe 


Arg Asp Gly 


Ser 


Met 


290 






295 




Cys 


Gin 


Tyr 


Val Glu Tyr 


lie 


Tfir 


305 












Leu 


Asp 


Leu 


Pro Phe Gin 


inr 


Arg 






325 






Thr 


Trp 


Leu 


Val Ser Asp 


Asn 


Val 






340 






Val 


Ser 


Ser 


Tyr Leu Val 


Gly 


Gin 






355 




360 


Asp 


Leu 


Ser 


Asn Phe Gin 


Leu 




370 






375 









235 
j -j 


240 


Leu 


Asn 


Leu 


T i 7A itt Cot* fll v & 1 a 
jjcu MJ.y oci uJ.y *vta 




OCA 




255 


Aia 




LjIU 


t«ys Jjyo uxu vixu ^- y o 


n c c 








Gin 


Gin 


GlU 


Liys A.sp irp >Ua iisp 










He 


Gly 


Lieu 


Asn iiys Asp ij.e lieu 








300 


Asn 


He 


Arg 


Met Gin Ala Val Gly 






315 


320 


Ser 


Asn 


Pro 


He Pro Trp He Asn 




330 




335 


Gin 


Val 


Ala 


Pro Gin Glu Val Glu 


345 






350 


He 


Asp 


Ser 


Glu Val Asp Thr Asp 



365 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 
<vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Ala Tyr Thr Thr Phe Ser Gin Asn Lys Asn Asp Gin Leu Lys Glu Pro 

15 10 15 

Met Phe Phe Gly Gin Asn Val Asn Val Ala Arg Tyr Asp Gin Gin Lys 

20 25 30 

Tvr Glu Thr Phe Glu Lys Leu He Glu Lys Gin Leu Ser Phe Phe Trp 

35 40 45 

Arg Pro Glu Glu Val Asp Val Ser Gin Asp Arg He Asp Tyr Ala Ala 

50 55 60 

Leu Pro Glu His Glu Lys His He Phe He Ser Asn Leu Lys Tyr Gin 
65 70 75 80 

Thr Leu Leu Asp Ser He Gin Gly Arg Ser Pro Asn Val Ala Leu Leu 

85 90 95 

Pro Leu Val Ser He Pro Glu Leu Glu Thr Trp He Glu Thr Trp Thr 

100 105 HO 

Phe Ser Glu Thr He His Ser Arg Ser Tyr Thr His He He Arg Asn 

H5 120 125 

He Val Asn Asp Pro Ser He Val Phe Asp Asp He Val Thr Asn Glu 

130 135 140 

Glu He He Lys Arg Ala Gin Asp He Ser Ser Tyr Tyr Asp Asp Leu 
145 150 155 160 

He Arq Asp Ser Gin Leu Tyr Gly Leu Tyr Gly Glu Gly Thr Tyr Thr 

165 170 175 

Val Asp Gly Lys Glu Cys Val Val Thr Leu Arg Ser Leu Lys Lys Gin 

180 I 85 190 

Leu Tvr Leu Cys Leu Met Ser Val Asn Ala Leu Glu Ala He Arg Phe 

ig5 200 205 

Tvr Val Ser Phe Ala Cys Ser Phe Ala Phe Ala Glu Arg Arg Leu Met 

210 215 220 

Glu Gly Asn Ala Lys He II Lys Ph He Ala Arg Asp Glu Ala Leu 
225 230 235 240 

His Leu Thr Gly Thr Gin His He Leu Asn He Met Ala Ala Gly Gin 

245 250 255 

Aso Aso Pro Glu Met Ala Glu He Ala Glu Glu Cys Lys Gin Glu Ala 
F * 260 265 270 
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Asp 


Leu 


IT tiC 


Val 




nla 


Aid 


Glu 


Gin Glu 


Lys 


Ala 


Trp 


Ala 


Asp 






275 










280 








285 






Tvr 


Leu 


Phe 


Lys 


Asp 


Gly 


Ser 


Met 


lie Gly Leu 


Asn Arg Asp 


Tl - 

i± e 


Leu 




290 










295 


















Val 


Gin 


Tvr 


Val 


Glu 


i yr 


lie 


X 111, 


Asn 


He Arg 


Met 


Gin 


Ala 


vax 


uiy 


305 










310 








315 










320 


L£U 


Pro 


Leu 


Pro 


Phe 


Gin 


Thr 




Ser 


Asn Pro 


He 


Pro 


Trp 


lie 


Asn 










325 










330 






335 




Ala 


Trp 


Leu 


Val 
340 


Ser 


Asp 


Asn 


Val 


Gin 
345 


Val Ala 


Pro 


Gin 


Glu 
350 


Val 


Glu 


Val 


Ser 


Ser 
355 


Tyr 


Leu 


Val 


Gly 


Gin 
360 


He 


Asp Ser 


Lys 


Val 
365 


Asp 


Thr 


Asn 


Asp 


Phe 
370 


Asp 


Asp 


Phe 


Ser 


Leu 
375 



















(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 479 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Asp Ala Asp Gly Ala Ser Pro Pro Pro Pro Arg Pro Ala Gly Gly 

15 10 15 

Pro Lys Asn Thr Pro Ala Ala Pro Pro Leu Tyr Ala Thr Gly Arg Leu 

20 25 30 

Ser Gin Ala Gin Leu Met Pro Ser Pro Pro Met Pro Val Pro Pro Ala 

35 40 45 

Ala Leu Phe Asn Arg Leu Leu Asp Asp Leu Gly Phe Ser Ala Gly Pro 

50 55 60 

Ala Leu Cys Thr Met Leu Asp Thr Trp Asn Glu Asp Leu Phe Ser Ala 
65 70 75 80 

Leu Pro Thr Asn Ala Asp Leu Tvr Arg Glu Cys Lys Phe Leu Ser Thr 

85 90 95 

Leu Pro Ser Asp Val Val Glu Trp Gly Asp Ala Tyr Val Pro Glu Arg 

100 105 110 

Ala Gin He Asp He Arg Ala His Gly Asp Val Ala Phe Pro Thr Leu 

115 120 125 

Pro Ala Thr Arg Asp Gly Leu Gly Leu Tyr Tyr Glu Ala Leu Ser Arg 

130 135 140 

Phe Phe His Ala Glu Leu Arg Ala Arg Glu Glu Ser Tyr Arg Thr Val 
145 150 155 160 

Leu Ala Asn Phe Cys Ser Ala Leu Tyr Arg Tyr Leu Arg Ala Ser Val 

165 " 170 175 

Arg Gin Leu His Arg Gin Ala His Met Arg Gly Arg Asp Arg Asp Leu 

180 185 190 

Gly Glu Met Leu Arg Ala Thr He Ala Asp Arg Tyr Tyr Arg Glu Thr 

195 200 205 

Ala Arg Leu Ala Arg Val Leu Phe Leu His Leu Tyr Leu Phe Leu Thr 

210 215 220 

Arg Glu He Leu Trp Ala Ala Tyr Ala Glu Gin Met Met Arg Pro Asp 
225 230 235 240 

Leu Phe Asp Cys Leu Cys Cys Asp Leu Glu Ser Trp Arg Gin Leu Ala 

245 250 255 

Gly Leu Phe Gin Pro Phe Met Phe Val Asn Gly Ala Leu Thr Val Arg 

260 265 270 

Gly Val Pro He Glu Ala Arg Arg Leu Arg Glu Leu Asn His He Arg 

275 280 285 

Glu His Leu Asn Leu Pro Leu Val Arg Ser Ala Ala Thr Glu Glu Pro 

290 295 300 

Gly Ala Pro Leu Thr Thr Pro Pro Thr Leu His Gly Asn Gin Ala Arg 
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305 










310 










315 






320 


Ala 


Ser 


Gly Tyr 


Phe 


Met 


Val 


Leu 


lie 


Arg 


Ala Lys Leu 


Asp 


Ser 


Tyr 










325 










330 






335 




Ser 


Ser 


Phe 


Thr 


Thr 


Ser 


Pro 


Ser 


Glu 


Ala 


Val Met Arg 


Glu 


His 


Ala 








340 










345 






350 






Tyr 


Ser 


Arg Ala Arg 


Thr 


Lys 


Asn 


Asn 


Tyr 


Glv Ser Thr 


lie 


Glu 


Gly 




355 










360 






365 








Leu 


Leu 


Asp 


Leu 


Pro 


Asp 


Asp 


Asp 


Ala 


Pro 


Glu Glu Ala 


Gly Leu 


Ala 




370 










375 








380 








Ala 


Pro 


Arg 


Leu 


Ser 


Phe 


Leu 


Pro 


Ala 


Gly 


His Thr Arg 


Arg 


Leu 


Ser 


385 










390 










395 






400 


Thr 


Ala 


Pro 


Pro 


Thr 


Asp 


Val 


Ser 


Leu 


Gly 


Asp Glu Leu 


His 


Leu 


Asp 










405 










410 






415 




Gly 


Glu 


Asp 


Val 


Ala 


Met 


Ala 


His 


Ala 


Asp 


Ala Leu Asp 


Asp 


Phe 


Asp 




420 










425 






430 






Leu 


Asp 


Met 


Leu Gly 


Asp 


Gly 


Asp 


Ser 


Pro 


Gly Pro Gly 


Phe 


Thr 


Pro 




435 










440 






445 








His 


Asp 


Ser 


Ala 


Pro 


Tyr 


Gly 


Ala 


Leu 


Asp 


Met Ala Asp 


Phe 


Glu 


Phe 




450 










455 








460 








Glu 


Gin 


Met 


Phe 


Thr 


Asp 


Ala 


Leu 


Gly 


lie 


Asp Glu Tyr 


Gly Gly 





465 470 475 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



Met 


Asp 


Leu 


Leu 


Val 


Asp 


Asp 


Leu 


Phe 


Ala 


Asp 


Arg Asp 


Gly Val 


Ser 


1 






5 










10 






15 




Pro 


Pro 


Pro 


Pro 


Arg 


Pro 


Ala 


Gly 


Gly 


Pro 


Lys 


Asn Thr 


Pro Ala 


Ala 








20 










25 








30 




Pro 


Pro 


Leu 


Tyr 


Ala 


Thr 


Gly 


Arg 


Leu 


Ser 


Gin 


Ala Gin 


Leu Met 


Pro 






35 








40 








45 






Ser 


Pro 


Pro 


Met 


Pro 


Val 


Pro 


Pro 


Ala 


Ala 


Leu 


Phe Asn 


Arg Leu 


Leu 




50 










55 










60 






Asp 


Asp 


Leu 


Gly 


Phe 


Ser 


Ala 


Gly 


Pro 


Ala 


Leu 


Cys Thr 


Met Leu 


Asp 


65 








70 










75 






80 


Thr 


Trp 


Asn 


Glu 


Asp 


Leu 


Phe 


Ser 


Gly 


Phe 


Pro 


Thr Asn 


Ala Asp 


Met 








85 










90 






95 




Tyr 


Arg 


Glu 


Cys 


Lys 


Phe 


Leu 


Ser 


Thr 


Leu 


Pro 


Ser Asp 


Val He 


Asp 




100 










105 








110 




Trp 


Gly 


Asp 


Ala 


His 


Val 


Pro 


Glu 


Arg 


Ser 


Pro 


He Asp 


He Arg 


Ala 


115 










120 








125 






His 


Gly 


Asp 


Val 


Ala 


Phe 


Pro 


Thr 


Leu 


Pro 


Ala 


Thr Arg 


Asp Glu 


Leu 




130 








135 










140 






Pro 


Ser 


Tyr 


Tyr 


Glu 


Ala 


Met 


Ala 


Gin 


Phe 


Phe 


Arg Gly 


Glu Leu 


Arg 


145 






150 










155 






160 


Ala 


Arg 


Glu 


Glu 


Ser 


Tyr 


Arg 


Thr 


Val 


Leu 


Ala 


Asn Phe 


Cys Ser 


Ala 








165 










170 






175 




Leu 


Tyr 


Arg 


Tyr 


Leu 


Arg 


Ala 


Ser 


Val 


Arg 


Gin 


Leu His 


Arg Gin 


Ala 




180 










185 








190 




His 


Met 


Arg 


Gly 


Arg 


Asn 


Arg 


Asp 


Leu 


Arg 


Glu 


Met Leu 


Arg Thr 


Thr 






195 








200 








205 






He 


Ala 


Asp 


Arg 


Tyr 


Tyr 


Arg 


Glu 


Thr 


Ala 


Arg 


Leu Ala 


Arg Val 


Leu 




210 






215 










220 






Phe 


Leu 


His 


Leu 


Tyr 


Leu 


Phe 


Leu 


Ser 


Arg 


Glu 


He Leu 


Trp Ala 


Ala 


225 








230 










235 






240 


Tyr 


Ala 


Glu 


Gin 


Met 


Met 


Arg 


Pro 


Asp 


Leu 


Phe 


Asp Gly 


Leu Cys 


Cys 








245 










250 






255 
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Asp 


Leu 


Glu 


Ser 


Trp 


Arg 


lain 


Leu 


Ala 


Cys 


t.oii PVio Ol n Pro 

IjCU SrlltS VjXII * A w 








260 










265 




270 




Phe 


lie 


Asn 


vsiy 


Ser 


Leu 


Thr 


V ct X 


Arg 


uiy 


1 prn Val Glu 


Ala Arg 






275 














285 




Arg 


Leu 


Arg 


blU 


Leu 


Asn 


X11S 


Tip 


Arg 


ox u 


UH c T.oii len T.fMi 
IlXo XjC Li noil ucu 


Pro Leu 


















300 




Val 


Arg 


Ser 


Ala 


Kin 
Aid 


Til a 

Aia 






Pro 




Ala rlw bcu 


Thr Pro 


305 








"inn 










TIC 


320 


Pro 


Val 


Leu 


Gin 


Gly 


Asn 


Cain 


Aia 


Arg 


Ser 

J J w 




Mat* T.f»u 

335 


Leu 


lie 


Arg 


Ala 


Lys 


Leu 


Asp 


Ser 


Tyr 


Coy* 


Cp r Ual aia ThlT 
OCI vox J. lii. 


Ser Glu 






340 










J4 j 




J J w 




Gly 


Glu 


Ser 


Val 


Met 


Arg 


G1U 


rllS 


Aia 


Tyr 


bet Arg Vaiy Ary 






"3 C C 










JDU 






J O 3 




Asn 


Asn 


Tyr 


Gly 


ser 


inr 


lie 


ii 


uiy 


Leu 


T on ten T.on Dt*o 
LcU Asp licU riu 


fien TV 
nay no^ 




370 








J / D 








380 




Asp 


Asp 


Ala 


Pro 


Ala 


GlU 


Ala 


pi,. 

tiiy 


Leu 


vai 


Aia fro Arg wee 


ser prie 


385 








390 












** 11 V 


Leu 


Ser 


Ala 


Gly 


Gin 


Arg 


Pro 


Arg 


Arg 


Leu 


ser Tnr inr Aia 


D-yA Tl a 

fro lie 








405 










4 jLU 






Thr 


Asp 


Val 


Ser 


Leu 


Gly 


Asp 


Glu 


Leu 


Arg 


Leu Asp uiy oiu 


fin \"a 1 
ulU Veil 






420 


















Asp 


Met 


Thr 


Pro 


Ala 


Asp 


Ala 


Leu 


Asp 


Asp 


rile ASp LcU ulu 


Mo^ T on 
rtcC LcU 




435 










440 






445 




Gly 


Asp 


Val 


Glu 


Ser 


Pro 


Ser 


Pro 


Gly 


Met 


Thr His Asp Pro 


Val Ser 


450 










455 








460 




Tyr 


Gly 


Ala 


Leu 


Asp 


Val 


Asp 


Asp 


Phe 


Glu 


Phe Glu Gin Met 


Phe Thr 


465 








470 










475 


480 


Asp 


Ala 


Met 


Gly 


lie 


Asp 


Asp 


Phe 


Gly 


Gly 












485 










490 







(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 





(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 41: 






Met 


Ser 


Gly 


Arg 


He 


Lys 


Thr 


Ala 


Gly 


Arg 


Ala 


Leu Ala Ser 


Gin 


Cys 


l 




5 










10 






15 




Gly 


Gly 


Ala 


Ala 


Ala 


Ala 


Thr 


Met 


Asp 


Pro 


Tyr 


Asp Ala He 


Glu 


Ala 




20 










25 






30 






Phe 


Asp 


Asp 


Ser 


Leu 


Leu 


Gly 


Ser 


Pro 


Leu 


Ala 


Ala Gly Pro 


Leu 


Tyr 




35 










40 








45 






Asp 


Gly 


Pro 


Ser 


Pro 


Ala 


Arg 


Phe 


Ala 


Leu 


Pro 


Pro Pro Arg 


Pro 


Ala 


50 










55 










60 






Pro 


Leu 


Ala 


Ala 


Leu 


Leu 


GlU 


Arg 


Met 


Gin 


Ala 


Glu Leu Gly 


Phe 


Pro 


65 










70 










75 






80 


Asp 


Gly 


Pro 


Ala 


Leu 


Leu 


Arg 


Ala 


Met 


Glu 


Arg 


Trp Asn Glu 


Asp 


Leu 






85 










90 






95 




Phe 


Ser 


Cys 


Leu 


Pro 


Thr 


Asn 


Ala 


Asp 


Leu 


Tyr 


Ala Asp Ala 


Ala 


Leu 






100 










105 






110 






Leu 


Ser 


Ala 


Asp 


Ala 


Asp 


Ala 


Val 


Val 


Gly 


Ala 


Met Tyr Leu 


Ala 


Val 






115 








120 








12S 






Pro 


Gly 


Asp 


Ala 


Glu 


Arg 


Leu 


Asp 


Leu 


Asn 


Ala 


His Ala Asn 


Gin 


Pro 




130 








135 










140 






Leu 


Pro 


Ala 


Pro 


Pro 


Ala 


Ser 


Glu 


Glu 


Gly 


Leu 


Pro Glu Tyr 


Val 


Ala 


145 










ISO 










155 






160 


Gly 


Val 


Gin 


Ala 


His 


Phe 


Leu 


Ala 


Glu 


Leu 


Arg 


Ala Arg Glu 


Glu 


Arg 








165 










170 






175 




Tyr 


Ala 


Gly 


Leu 


Phe 


Leu 


Gly 


Tyr 


Cys 


Arg 


Ala 


Leu Leu Gin 


His 


Leu 
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180 185 190 

Arg Ala Thr Ala Ala Arg Gly Arg Gly Ala Ala Gly Ala Gly Ala Gin 

195 200 205 

Ala Asp Arg Leu Arg Gin Leu Val Ala Ala Arg Tyr Tvr Arg Glu Ala 

210 215 220 

Ser Arg Leu Ala Arg Leu Ala Phe Ala His Met Tyr Val Ala Thr Ala 
225 230 235 240 

Arg Glu Val Ser Trp Arg Leu His Ser Gin Gin Ser Gin Ala Gin Gly 

245 250 255 

Val Phe Val Ser Leu Tyr Tyr Ala Trp Pro Gin Arg Arg Gin Phe Thr 

260 265 270 

Cys Leu Phe His Pro Val Leu Phe Asn His Gly Val Val Ala Leu Glu 

275 280 285 

Asp Gly Phe Leu Asp Ala Ala Glu Leu Arg Arg Leu Asn Tvr Arg Arg 

290 295 300 

Arg Glu Leu Gly Leu Pro Leu Val Arg Ala Gly Leu Val Glu Val Glu 
305 310 315 320 

Val Gly Pro Leu Val Glu Glu Pro Pro Phe Ser Gly Ser Leu Pro Arg 

325 330 335 

Ala Leu Gly Phe Leu Asn Tyr Gin Val Arg Ala Lys Met Glv Ala Pro 

340 345 350 

Ala Glu Ala Gly Gly Gly Trp Arg Arg Ser Gly Ser Thr Arg Thr Arg 

355 360 365 

Gly Arg Ala Ala Arg Ser Thr Thr Gly Arg Leu Gin Arg Pro Cvs Cys 

370 375 380 

Gly Pro Arg Arg Arg Ala Lys Cys Cys Arg Ala Thr Pro Arg Gin Arg 
385 390 395 400 

Leu Arg Ala Arg Gly Glu Pro Arg His Thr Ser Gly Ser Gly Ala Phe 

405 410 415 

Ser Gin Gly Arg Arg Pro Gly Arg Val Cys Arg Leu Gly Tro Ala Cys 

420 425 430 

Lys Ala Arg Ser Gly Pro Ala Arg Gly Gly Pro Gly Pro Ser Pro Val 

435 440 445 

Arg Ser Gly Leu Gly Leu Ser Arg Ala Arg Gly Ser Pro Gly Pro Gly 

450 455 460 

Pro Ala Cys Gly Gly Pro Ser Arg Ala Arg Gly Gly Arg Arg Arg Ala 
465 470 475 480 

Ser Pro Ala Asn Pro Phe Gly Gly Thr Tyr Asp Ala Leu Leu Gly Asp 

485 490 495 

Arg Leu Asn Gin Leu Leu Asp Phe 
500 



(2) INFORMATION FOR SEQ ID NO; 42; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



Met 


Glu 


Cys 


Asn 


Leu 


Gly 


Thr 


Glu 


His 


Pro 


Ser Thr Asp Thr Trp Asn 


1 








5 










10 


15 


Arg 


Ser 


Lys 


Thr 


Glu 


Gin 


Ala 


Val 


Val 


Asp 


Ala Phe Asp Glu Ser Leu 


*> 






20 










25 




30 


Phe 


Gly 


Asp 
35 


Val 


Ala 


Ser 


Asp 


He 
40 


Gly 


Phe 


Glu Thr Ser Leu Tyr Ser 
45 


His 


Ala 


Val 


Lys 


Thr 


Ala 


Pro 


Ser 


Pro 


Pro 


Trp Val Ala Ser Pro Lys 


♦ 


50 








55 








60 


He 


Leu 


Tyr 


Gin 


Gin 


Leu 


He 


Arg 


Asp 


Leu 


Asp Phe Ser Glu Gly Pro 


65 










70 










75 " 80 


Arg 


Leu 


Leu 


Ser 


Cys 
85 


Leu 


Glu 


Thr 


Trp 


Asn 
90 


Glu Asp Leu Phe Ser Cys 
95 
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Irxie 


Pro 


Tip 

lie 


As 21 


fSl 11 


7\ en 






Ser 




J'lC u 11c u V a. X 


T .pi i Cor Pro 
xicu ocx rxu 








100 










105 






110 


ASp 


Zr £. (J 


ASO 


Asp 


Val 


He 


Ser 


Thr 


Val 


Ser 


Thr I ,vq A cm 


His Val Glu 






115 










120 






125 




Met- 
ric u 


Phf» 


As 11 


Leu 


Thr 


Thr 


Arcr 


0 — / 


Ser 


Val 


Arcr L^u Pt*o 


Cpr Pro Pro 




130 










135 








140 




Jjy o 


Gl n 


XT X U 


X XIX 


Glv 

V3 X y 


Leu 


Pro 


Ala 


TvT 
xyx 


Val 


Gin Glu Val 
uxii vjx u vax 


Gl n A^n S^r 

wXXX /wU WCX 


XTB J 










150 










155 


160 


r*iic 


1 J IX 


Val 


Glu 


Leu 


A rir 


Ala 




Glu 


Glu 


J\l a Tvr Th r 
nx 0 x y x x xxx 


T.vc T.Pii T.pii 
uyo xjc u ijcu 










165 










170 




175 


Va 1 
vai 


Thr 


Tvr 

/ 




Lys 


OCX 


He 


He 


Arcr 


Tvr 
x yx 


T.pii Gin Glv 
jjcu V3xii wxy 


Thr Ala T.v^ 

X 1XX /VA CI. XJjrO 








180 










185 






190 


7X T*CT 

Arg 


Thr 


X XIX] 


He 


Glv 
oay 


Leu 


Asn 


He 


Gin 


Asn 


Pro A en Gl n 


Lve IV 1 a Tvr 
Ljrs /vx ct xyx 






195 










200 






205 




X XXX 


Gin 

will 


xjc ix 


Jiro 

**xy 


Gin 


Ser 


He 


Leu 


Leu 


Arcr 


Tvr Tvr Aro 

x y x x y x ^%x y 


Glu Val Ala 




^ X w 










215 








220 




Car 


Leu 


i"i*l d 


a rrr 

Krg 


XJC IX 


T .<3>1 1 
XjCIX 


X V X 


Leu 


His 


T.fai 1 
XiC U 


Tvr T.011 Thr 
xyx xjcix xxxx 


Va 1 Thr Aro 
vax x xxx /u y 


**** ^ 










230 










235 


240 


uiu 




Cor 


Trn 

irp 


Arg 


T .01 1 


Tl fy 

iyrr 


Ala 

AXa 


c^r 

OCX 


will 


ocx nia fix 49 


Orn A cn \7al 
zr x nop Val 










5d5 










250 




4. ^ j 


file 


TV 1 a 

Aia 


TV 1 a 

Aia 


Leu 


Lys 


DVi a 

file 


Th r 

X XIX 


_ 

Trp 


Thr 
1 ILL 


Gl 11 
ulu 


Hiy Axy ulu 


DKn Thr fSre 
rile xxTx V^yS 


















X D 3 






4 f \J 


Ala 


rlic 


XIX S 


Pro 


vox 


Leu 




ash 


His 


gi v 

yxy 


TT p 1 T .mi 
lie vax Xjcix 


xjcu wxu ox y 






575 










280 






285 




Lys 


Pro 


Leu 


Thr 


Hid 


ser 


nl a 


Leu 


_ 

Arg 


ul 11 


T 1 0 J\ en T\/r 
lie iibu 1 VI 


TV ttt IV yn 7A rrr 
>vL^ i"l>X ^ /i-X y 




290 










^ -7 3 












ulu 


Leu 


pi *> 
taiy 


Leu 


Pro 


Leu 


Val 


Arg 


uys 


\jxy 


L6U VoX ulu 


ulu >isxx liys 


*} n c 










•5 1 n 

J1U 










Jl3 


J* V 


Ser 


Pro 


Leu 


val 


Gin 


Gin 


Pro 


Ser 


QLa 

rnc 


Ser 


Val nlS LeU 


Pro Arg Ser 




















Tin 




7 7 c 


vai 


Gly 


Piie 


Leu 


Tiir 


nis 


IT ^ — 

nlS 


T 1 0 


Lys 


Arg 


Lys Leu Asp 


Ala 1 yx ilia 








340 










345 






350 


Val 


Lys 


His 


Pro 


Gin 


Glu 


Pro 


Arg 


His 


Val 


Ara Ala Asp 


His Pro Tyr 




355 










360 






365 




Ala 


Lys 


Val 


Val 


Glu 


Asn 


Arg 


Asn 


Tyr 


Gly 


Ser Ser He 


Glu Ala Met 




370 










375 








380 




He 


Leu 


Ala 


Pro 


Pro 


Ser 


Pro 


Ser 


Glu 


He 


Leu Pro Gly 


Asp Pro Pro 


385 










390 










395 


400 


Arg 


Pro 


Pro 


Thr 


Cys 


Gly 


Phe 


Leu 


Thr 


Arg 







405 410 



(2) INFORMATION FOR SEQ ID NO: 43: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 



Met 


Ala 


Ala 


Asn 


He 


Ala 


Met 


Phe 


Ala 


Asp 


He 


Glu Asp 


Tyr Asp Asp 


1 








5 










10 






15 


Thr 


Arg 


Ser 


Cys 


Glu 


Tyr 


Gly 


Tyr 


Gly 


Thr 


Cys 


Glu Leu 


Met Asp Val 






20 










25 








30 


Asp 


Gly 


Val 


Val 


Ala 


Ser 


Phe 


Asp 


Glu 


Gly 


Met 


Leu Ser 


Ala Ser Glu 


35 










40 








45 




Ser 


He 


Tyr 


Ser 


Ser 


Pro 


Ala 


Gin 


Lys 


Arg 


Leu 


Ala Leu 


Pro Pro Pro 




50 








55 










60 




Lys 


Ala 


Thr 


Ser 


Pro 


Thr 


Ala 


Leu 


Tyr 


Gin 


Arg 


Leu Gin 


Ala Glu Leu 


65 










70 










75 




80 


Gly 


Phe 


Pro 


Glu 


Gly 


Gin 


Ala 


Met 


Leu 


Phe 


Ala 


Met Glu 


Lvs Trp Asn 








85 










90 






95 


Glu 


Asp 


Met 


Phe 


Ser 


Ala 


He 


Pro 


Val 


His 


Val 


Asp Leu 


Tyr Thr Glu 
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100 105 110 



He 


Ala 


Leu 
115 


Leu 


Ser 


Thr 


Ser 


Val 
120 


Asn 


Glu 


Val 


Val 


Lys 
125 


Ala 


Gly 


Leu 


Asp 


Ser, 


, Leu 


Pro 


He 


Pro 


Thr 


Asn 


Tyr 


He 


Pro 


Glu 


Val 


Asp 


Leu 


Asn 


130 










135 










140 










Ala 


His 


Gly 


Ser 


Glu 


Pro 


Phe 


Pro 


Glu 


Val 


Pro 


Ala 


Leu 


Glu 


Asp 


Glu 


145 








150 










155 










160 


Leu 


Glu 


Thr 


Tyr 


Val 
165 


lie 


Ser 


TV 1 — 

Ala 


Gin 


Arg 
170 


Pne 


Tyr 


Leu 


Ser 


(jIU 
175 


T Alt 

Leu 


Arg 


Ala 


Arg 


Glu 


Glu 


His 


Tyr 


Ser 


Arg 


Leu 


Leu 


Arg Gly Tyr 


Cys 


val 




180 










185 










i on 

17V 






Ala 


Leu 


Leu 
195 


His 


Tyr 

/ 


Leu 


Tyr 


Gly 

Z DO 


Ser 


Ala 


Lys 


Arg 


Gin 
205 


Leu 


Arg 


Gly 


A J. a 


Gly 


Ser 


Asp 


OCX 


nla 


T .01 i 




Hi <5 






Lvs 


Gin 


Val 


Val 






210 








4. ±. -J 










220 










ho Lv 


Arg 


Tyr 


j.yx 




Glu 


Thr 


Ala 


Asn 


Leu 


Ala 


Arg 


Leu 


Leu 


Tvr 


Leu 


one 










^ o \j 










^ *j 










240 


XT'* c 


Leu 


Tyr 


lie 


Ser 


Val 


Thr 




Glu 


Val 


Ser 


Trp Arg 


Leu 


His 


Ala 
































Ser 


Gin 


Val 


vai 
260 


Asn 


bin 


uiy 


lie 


jrne 
265 


vai 


Ser 


Leu 


His 


Tyr 
270 


1 IIL 


lrp 


Pro 


Gin Arg 


Arg 


Lys 


Phe 


Glu 


Cys 


Leu 


Phe 


His 


Pro 


Val 


Leu 


Phe 


Asn 






275 










280 










285 








His 


Gly Val 


Val 


He 


Leu 


Glu 


Asn 


Asp 


Pro 


Leu 


Glu 


Phe 


Asn 


Asp 


Leu 




290 










295 










300 










Gin 


Arg 


He 


Asn 


Tyr 


Arg 


Arg 


Arg 


Glu 


Leu 


Gly 


Leu 


Pro 


Leu 


He 


Arg 


305 








310 










315 










320 


Ala 


Gly 


Leu 


He 


GlU 


Glu 


Glu 


Asn 


Leu 


Pro 


Leu 


Glu 


Ser 


Glu 


Pro 


Thr 








325 










330 










335 




Phe 


Ser 


Gly 


Lys 


Leu 


Pro 


Arg 


Thr 


He 


Gly 


Phe 


Leu 


Thr 


His 


Gin 


He 






340 










345 










350 






Arg 


Thr 


Lys 


Met 


Glu 


Ala 


Tyr 


Ser 


Asn 


Ala 


His 


Pro 


Ser 


Thr 


Pro 


Leu 




355 










360 










365 








Phe 


Pro 


Leu 


Ala 


Glu 


His 


Ser 


Tyr 


Ser 


Lys 


Arg 


He Asp Gly 


Arg 


Leu 




370 










375 










380 










Ser 


Tyr Gly 


Thr 


Thr 


Ala 


GlU 


Ala 


Met 


Met 


Asp 


Pro 


Pro 


Ser 


Pro 


Ser 


385 










390 










395 










400 


Ala 


Val 


Leu 


Pro 


Gly 


Asp 


Pro 


Val 


Pro 


Pro 


Leu 


Thr Val Gly 


He 


Arg 










405 








410 










415 




Gin 


Thr 


Ala 


Glu 
420 


Thr 


Leu 


Ala 


Leu 


Pro 
425 


Ser 


Asn 


Leu 


Thr 


Leu 
430 


Gin 


Ser 


Met 


Glu 


Thr 


Asp 


Val 


Leu 


Asp 


Tyr 


Ser 


Ser 


He 


Ser Gly Asp 


Glu 


Leu 






435 










440 










445 








Asn 


Gin 
450 


Met 


Phe 


Asp 


He 























(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 
<iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ IDNO:44: 

Met Cys Leu Leu His He Ser Leu Pro Tyr Leu Ser Cys Ala Leu Leu 

1 5 10 15 

Pro Gly Trp Tyr Phe Asp Ala Arg Pro Ala Al * Ser He Val Met Phe 

20 25 * 30 

Ala Ala Ala Glu Glu Asn Asp Asp Pro Tyr Pro Gly Lys Ser Gly Tyr 

35 40 45 

Asn Asp Thr Cys Glu Leu Met Asp Met Asp Gly Ala Val Ala Ser Phe 
50 55 60 
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75 80 
Lys Ala Ala Ser Pro Gly Ala 
90 95 
Gly Phe Pro Glu Gly Gin Thr 
110 

Glu Ast» Met Phe Ser Ala Leu 
125 

He Ala Leu Leu Ser Thr Ser 
140 

Asp Ser Leu Pro Thr Pro Ser 
155 160 
Ala His Gly Asp Glu Pro Phe 
170 175 
Leu Glu He Tyr Val He Ser . 

190 

Arg Thr Arg Glu Glu His Tyr 
205 

Ala Leu Leu His Tyr Leu Tyr 
220 

Ser Gly Ser Asp Ala Ser Leu 
235 240 
Asp Arg Tyr Tyr Arg Glu Ala 
250 255 
His Leu Tyr Val Ser Val Thr 
270 

Ser Gin Val He Asn Gin Gly 
285 

Ala Gin Arg Arg Lys Phe Glu 
300 

His Gly Val Val He Leu Glu 
315 320 
Gin Arg He Asn Tyr Arg Arg 
330 335 
Ala Gly Leu He Glu Glu Glu 
350 

Phe Ser Gly Lys Leu Pro Arg 
365 

Arg Thr Lys Met Glu Ala Tyr 
380 

Phe Pro Leu Ala Glu His Ser 
395 400 
Ser Tyr Gly Thr Thr Thr Glu 
410 415 
Ala Val Leu Pro Gly Asp Pro 
430 

Gin Thr Ala Ala Thr Leu Ala 
445 

Met Glu Thr Asp Gly Leu Asp 
460 

Asn Gin Met Phe Asp He 
475 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 52 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
Met Gly Pro Leu Met Val Leu Phe Cys Leu Leu Phe Leu Tyr Pro Gly 

SUBSTITUTE SHEET (MLE 26) 



Asp 


Glu Gly 


Met 


Leu 


Ser 


Ala 


He 


Glu 


65 










70 








Lys 


Lys 


Arg 


Leu 


Ala 
85 


Leu 


Pro 


Pro 


Pro 


Leu 


Tyr 


Gin 


Arg 


Leu 


Gin 


Gly 


Glu 


Leu 






100 










105 


Leu 


Leu 


Ser 


Ala 


Met 


Glu 


Lys 


Trp 


Asn 






115 










120 




Pro 


Gly His 


Val 


Asp Leu 


Tyr 


Thr 


Glu 




130 










135 






Val 


Asp 


Glu 


Val 


Val Arg 


Ala 


Gly 


Leu 


145 








150 








His 


Tyr 


Ser 


Pro 


Glu 


Val 


Asp 


Leu 


Asn 








165 










Pro 


Glu 


Val 


Pro 


Ala 


Leu 


Glu 


Asp 


Asp 








180 










185 


Ala 


Gin Arg 


Phe 


Tyr 


Leu 


Ser 


Glu 


Leu 






195 










200 




Ala 


Arg 


Leu 


Leu 


Arg Gly 


Tyr 


Cys 


Val 




210 










215 






Gly 


Ser 


Ala 


Lys 


Arg 


Gin 


Leu 


Arg 


Gly 


225 










230 








Met 


His 


Lys 


Phe 


Lys 


Gin 


Val 


Val 


Arg 








245 










Ala 


Asn 


Leu 


Ala 


Arg 


Leu 


Leu 


Tyr 


Leu 








260 










265 


Arg 


Glu 


Val 


Ser 


Trp Arg 


Leu 


His 


Ala 




275 










280 




Val 


Phe 


Val 


Ser 


Leu 


His 


Tyr 


Phe 


Trp 




290 










295 






Cys 


Leu 


Phe 


His 


Pro 


Val 


Leu 


Phe 


Asn 


305 










310 








Asn 


Asp 


Pro 


Leu 


Glu 


Phe 


His 


Asp 


Leu 








325 










ArQ 


Glu 


Leu 


Gly 


Leu 


Pro 


Leu 


lie 


Arg 






340 










345 


Asn 


Ser 


Pro 


Leu 


Glu 


Ala 


Glu 


Pro 


Leu 






355 










360 




Thr 


He Gly 


Phe 


Leu 


Thr 


His 


Gin 


He 




370 










375 






Ser 


Asp 


Ala 


His 


Pro 


Ala 


Thr 


Pro 


Leu 


385 








390 








Tyr 


Ser 


Lys 


Arg 


He 


Gly 


Gly 


Arg 


Leu 








405 










Ala 


Met 


Met 


Asp 


Pro 


Pro 


Ser 


Pro 


Ser 








420 










425 


Val 


Pro 


Pro 


Leu 


Thr 


Val 


Gly 


Val 


Arg 






435 










440 




lie 


Pro 


Ser 


Asn 


Leu 


Thr 


Leu 


Gin 


Ser 




450 










455 


Glu 




Tyr 


Ser 


Ser 


Met 


Thr Gly 


Asp 


Leu 


465 










470 
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15 10 15 

Leu Ala Asp Ser Ala Pro Ser Cys Pro Gin Asn Val Asn He Ser Gly 

20 25 30 

Gly Thr Phe Thr Leu Ser His Gly Trp Ala Pro Gly Ser Leu Leu Thr 

35 40 45 

Tyr Ser Cys Pro Gin Gly Leu Tyr Pro Ser Pro Ala Ser Arg Leu Cys 

50 55 60 

Lys Ser Ser Gly Gin Trp Gin Thr Pro Gly Ala Thr Arg Ser Leu Ser 
€5 70 75 80 

Lys Ala Val Cys Lys Pro Val Arg Cys Pro Ala Pro Val Ser Phe Glu 

85 90 95 

Asn Gly He Tyr Thr Pro Arg Leu Gly Ser Tyr Pro Val Gly Gly Asn 

100 105 110 

Val Ser Phe Glu Cys Glu Asp Gly Phe He Leu Arg Gly Ser Pro Val 

115 120 125 

Arg Gin Cys Arg Pro Asn Gly Met Trp Asp Gly Glu Thr Ala Val Cys 

130 135 140 

Asp Asn Gly Ala Gly His Cys Pro Asn Pro Gly He Ser Leu Gly Ala 
145 150 155 160 

Val Arg Thr Gly Phe Arg Phe Gly His Gly Asp Lys Val Arg Tyr Arg 

165 170 175 

Cys Ser Ser Asn Leu Val Leu Thr Gly Ser Ser Glu Arg Glu Cys Gin 

180 185 190 

Gly Asn Gly Val Trp Ser Gly Thr Glu Pro He Cys Arg Gin Pro Tyr 

195 200 205 

Ser Tyr Asp Phe Pro Glu Asp Val Ala Pro Ala Leu Gly Thr Ser Phe 

210 215 220 

Ser His Met Leu Gly Ala Thr Asn Pro Thr Gin Lys Thr Lys Glu Ser 
225 230 235 240 

Leu Gly Arg Lys He Gin He Gin Arg Ser Gly His Leu Asn Leu Tyr 

245 250 255 

Leu Leu Leu Asp Cys Ser Gin Ser Val Ser Glu Asn Asp Phe Leu He 

260 265 270 

Phe Lys Glu Ser Ala Ser Leu Met Val Asp Arg He Phe Ser Phe Glu 

27S 280 285 

He Asn Val Ser Val Ala He He Thr Phe Ala Ser Glu Pro Lys Val 

290 295 300 

Leu Met Ser Val Leu Asn Asp Asn Ser Arg Asp Met Thr Glu Val He 
305 310 315 320 

Ser Ser Leu Glu Asn Ala Asn Tyr Lys Asp His Glu Asn Gly Thr Gly 

325 330 335 

Thr Asn Thr Tyr Ala Ala Leu Asn Ser Val Tyr Leu Met Met Asn Asn 

340 345 350 

Gin Met Arg Leu Leu Gly Met Glu Thr Met Ala Trp Gin Glu He Arg 

355 360 365 

His Ala He He Leu Leu Thr Asp Gly Lys Ser Asn Met Gly Gly Ser 

370 375 380 

Pro Lys Thr Ala Val Asp His He Arg Glu He Leu Asn He Asn Gin 
385 390 395 400 

Lys Arg Asn Asp Tyr Leu Asp He Tyr Ala He Gly Val Gly Lys Leu 

405 410 415 

Asp Val Asp Trp Arg Glu Leu Asn Glu Leu Gly Ser Lys Lys Asp Gly 

420 425 430 

Glu Arg His Ala Phe He Leu Gin Asp Thr Lys Ala Leu His Gin Val 

435 440 445 

Phe Glu His Met Leu Asp Val Ser Lys Leu Thr Asp Thr He Cys Gly 

450 455 460 

Val Gly Asn Met Ser Ala Asn Ala Ser Asp Gin Glu Arg Thr Pro Trp 
465 470 475 480 

His Val Thr He Lys Pro Lys Ser Gin Glu Thr Cys Arg Gly Ala Leu 

485 490 495 

He Ser Asp Gin Trp Val Leu Thr Ala Ala His Cys Phe Arg Asp Gly 

500 505 510 

Asn Asp His Ser Leu Trp Arg Val Asn Val Gly Asp Pro Lys Ser Gin 

515 520 525 

Trp Glv Lys Glu Leu Leu He Glu Lvs Ala Val He Ser Pro Gly Phe 

530 535 540 

Asp Val Phe Ala Lys Lvs Asn Gin Gly He Leu Glu Phe Tyr Gly Asp 
545 550 555 560 

Asp He Ala Leu Leu Lys Leu Ala Gin Lys Val Lys Met Ser Thr His 
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S65 570 575 



Ala 


Arg 


Pro 


lie 


Cys 


Leu 


Pro 


Cys 


Thr 


Met 


Glu 


Ala 


Asn Leu Ala 


Leu 






580 










585 








590 




/ti y 




Pro 


Gin 


Glv 


Ser 


Thr 


Cys 


Ara 


Asp 


His 


Glu 


Asn Glu Leu 


Leu 




595 










600 










605 




Asn 


Lys 


Gin 


Ser 


Val 


Pro 


Ala 
615 


His 


Phe 


Val 


Ala 


Leu 
620 


Asn Gly Ser 


Lys 


Leu 


Asn 


He 


Asn 


Leu 


Lvs 


Met 


Gly 


Val 


Glu 


Trp 


Thr 


Ser Cys Ala 


Glu 


625 










630 










635 






640 


Val 


Val 


Ser 


Gin 


Glu 
645 


Lvs 


Thr 


Met 


Phe 


Pro 
650 


Asn 


Leu 


Thr Asp Val 
655 


Arg 


Glu 


Val 


Val 


Thr 


Asd 


Gin 


Phe 


Leu 


Cys 


Ser 


Gly Thr 


Gin Glu Asp 


Glu 








660 










665 








670 




Ser 


Pro 


Cys 
675 


Lys 


Gly 


Glu 


Ser 


Gly 
680 


Gly 


Ala 


Val 


Phe 


Leu Glu Arg 
685 


Arg 


Phe 


Arg 


Phe 


Phe 


Gin 


Val 


Gly 


Leu 


Val 


Ser Trp Gly 


Leu Tyr Asn 


Pro 




690 










695 










700 






Cys 


Leu 


Gly Ser 


Ala 


Asp 


Lys 


Asn 


Ser 


Arg 


Lys 


Arg 


Ala Pro Arg 


Ser 


705 










710 










715 






720 


Lys 


Val 


Pro 


Pro 


Pro 


Arg 


Asp 


Phe 


His 


lie 


Asn 


Leu 


Phe Arg Met 


Gin 








725 










730 






735 




Pro 


Tip 


Leu Arg 


Gin 


His 


Leu 


Gly 


Asp 


Val 


Leu 


Asn 


Phe Leu Pro 


Leu 



740 745 750 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 
(Vi) ORIGINAL SOURCE: 





(xi) SEQUENCE 


DESCRIPTION; 


: SEQ ID 


NO: 46 : 








Met 


Ala 


Pro 


Leu 


Leu 


Ala 


Leu 


Phe 


Tyr 


Leu 


Leu 


Gin 


Leu Gly 


Pro 


Gly 


1 








5 










10 








15 




Leu 


Ala 


Ala 


Leu 


Phe 


Cys 


Asn 


Gin 


Asn 


Val 


Asn 


He 


Thr Gly Gly 


Asn 








20 










25 








30 






Phe 


Thr 


Leu 


Ser 


His 


Gly 


Trp 


Ala 


Pro 


Gly 


Ser 


Leu 


Leu He 


Tyr 


Ser 






35 










40 










45 






Cys 


Pro 


Leu 


Gly 


Arg 


Tyr 


Pro 


Ser 


Pro 


Ala 


Trp 


Arg 


Lys Cys 


Gin 


Ser 


50 










55 










60 








Asn 


Gly 


Gin 


Trp 


Leu 


Thr 


Pro 


Arg 


Ser 


Ser 


Ser 


His 


His Thr 


Leu 


Arg 


65 






70 










75 








80 


Ser 


Ser 


Arg 


Met 


Val 


Lys 


Ala 


Val 


Cys 


Lys 


Pro 


Val 


Arg Cys 


Leu 


Ala 








65 










90 








95 




Pro 


Ser 


Ser 


Phe 


Glu 


Asn 


Gly 


He 


Tyr 


Phe 


Pro 


Arg 


Leu Val 


Ser 


Tyr 








100 










105 








110 






Pro 


Val 


Gly 


Ser 


Asn 


Val 


Ser 


Phe 


Glu 


Cys 


Asp 


Glu 


Asp Phe 


Thr 


Leu 






115 










120 










125 






Arg 


Gly 


Ser 


Pro 


Val 


Arg 


Tyr 


Cys 


Arg 


Pro 


Asn 


Gly 


Leu Trp Asp 


Gly 


130 










135 










140 








Glu 


Thr 


Ala 


Val 


Cys 


Asd 


Asn 


Gly 


Ala 


Ser 


His 


Cys 


Pro Asn 


Pro 


Gly 


145 










150 










155 








160 


He 


Ser 


Val 


Gly 


Thr 


Ala 


Arg 


Thr 


Gly 


Leu 


Asn 


Phe 


Asp Leu Gly 


Asp 








165 










170 








175 




Lys 


Val 


Arg 


Tyr 


Arg 


Cys 


Ser 


Ser 


Ser 


Asn 


Met 


Val 


Leu Thr Gly 


Ser 




180 










185 








190 






Ala 


Glu 


Arg 


Glu 


Cys 


Gin 


Ser 


Asn 


Gly 


Val 


Tr 


Ser 


Gly Ser Glu 


Pro 






195 








200 










205 






He 


Cys 


Arg 


Gin 


Pro 


Tyr 


Ser 


Tyr 


Asp 


Phe 


Pro 


Glu 


Asp Val 


Ala 


Ser 




210 








215 










220 








Ala 


Leu 


Asp 


Thr 


Ser 


Leu 


Thr 


Asn 


Leu 


Leu 


Gly 


Ala 


Thr Asn 


Pro 


Thr 



225 230 235 240 
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Gin Asn Leu Leu Thr Lys Ser Leu Gly Arg Lys lie lie lie Gin Arg 

245 250 255 

Ser Gly His Leu Asn Leu Tyr Leu Leu Leu Asp Ala Ser Gin Ser Val 

260 265 270 

Thr Glu Lys Asp Phe Asp lie Phe Lys Lys Ser Ala Glu Leu Met Val 

275 280 285 

Glu Arg lie Phe Ser Phe Glu Val Asn Val Thr Val Ala He He Thr 

290 295 300 

Phe Ala Ser Gin Pro Lys Thr He Met Ser He Leu Ser Glu Arg Ser 
305 310 315 320 

Gin Asp Val Thr Glu Val He Thr Ser Leu Asp Ser Ala Ser Tyr Lys 

325 330 335 

Asp His Glu Asn Ala Thr Gly Ala Asn Thr Tyr Glu Val Leu He Arg 

340 345 350 

Val Tyr Ser Met Met Gin Thr Gin Met Asp Arg Leu Gly Met Glu Thr 

355 360 365 

Ser Ala Trp Lys Glu He Arg His Thr He He Leu Leu Thr Asp Gly 

370 375 380 

Lys Ser Asn Met Gly Asp Ser Pro Lys Lys Ala Val Thr Arg He Arg 
385 390 395 400 

Glu Leu Leu Ser He Glu Gin Asn Arg Asp Asp Tyr Leu Asp He Tyr 

405 410 ~ 415 

Ala He Gly Val Gly Lys Leu Asp Val Asp Trp Lys Glu Leu Asn Glu 

420 425 430 

Leu Gly Ser Lys Lys Asp Gly Glu Arg His Ala Phe He Leu Gin Asp 

435 440 445 

Ala Lys Ala Leu Gin Gin He Phe Glu His Met Leu Asp Val Ser Lys 

450 455 460 

Leu Thr Asp Thr He Cys Gly Val Gly Asn Met Ser Ala Asn Ala Ser 
465 470 475 480 

Asp Gin Glu Arg Thr Pro Trp Gin Val Thr Phe Lys Pro Lys Ser Lys 

465 490 495 

Glu Thr Cys Gin Gly Ser Leu He Ser Asp <3ln Trp Val Leu Thr Ala 

500 505 510 

Ala His Cys Phe His Asp He Gin Met Glu Asp His His Leu Trp Arg 

515 520 525 

Val Asn Val Gly Asp Pro Thr Ser Gin His Gly Lys Glu Phe Leu Val 

530 535 540 

Glu Asp Val He He Ala Pro Gly Phe Asn Val His Ala Lys Arg Lys 
545 550 555 560 

Gin Gly He Ser Glu Phe Tyr Ala Asp Asp He Ala Leu Leu Lys Leu 

565 570 575 

Ser Arg Lys Val Lys Met Ser Thr His Ala Arg Pro He Cys Leu Pro 

580 585 590 

Cys Thr Val Gly Ala Asn Met Ala Leu Arg Arg Ser Pro Gly Ser Thr 

595 600 605 

Cys Lys Asp His Glu Thr Glu Leu Leu Ser Gin Gin Lys Val Pro Ala 

610 615 620 

His Phe Val Ala Leu Asn Gly Asn Arg Leu Asn He Asn Leu Arg Thr 
625 630 635 640 

Gly Pro Glu Trp Thr Arg Cys He Gin Ala Val Ser Gin Asn Lys Asn 

645 650 655 

He Phe Pro Ser Leu Thr Asn Val Ser Glu Val Val Thr Asp Gin Phe 

660 665 670 

Leu Cys Ser Gly Met Glu Glu Glu Asp Asp Asn Pro Cys Lys Gly Glu 

675 680 685 

Ser Gly Gly Ala Val Phe Leu Gly Arg Arg Tyr Arg Phe Phe Gin Val 

690 695 700 

Gly Leu Val Ser Trp Gly Leu Phe Asp Pro Cys His Gly Ser Ser Asn 
705 710 715 720 

Lys Asn Leu Arg Lys Lys Pro Pro Arg Gly Val Leu Pro Arg Asp Phe 

725 730 735 

His He Ser Leu Phe Arg Leu Gin Pro Trp Leu Arg Gin His Leu Asp 

740 745 750 

Gly Val Leu Asp Phe Leu Pro Leu 
755 760 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 764 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Gly Ser Asn Leu Ser Pro Gin Leu Cvs Leu Met Pro Phe lie Leu 

15 lb 15 

Gly Leu Leu Ser Gly Gly Val Thr Thr Thr Pro Trp Ser Leu Ala Arg 

20 25 30 

Pro Gin Gly Ser Cys Ser Leu Glu Gly Val Glu lie Lys Gly Gly Ser 

35 40 45 

Phe Arg Leu Leu Gin Glu Gly Gin Ala Leu Glu Tyr Val Cys Pro Ser 

50 55 60 

Gly Phe Tyr Pro Tyr Pro Val Gin Thr Arg Thr Cys Arg Ser Thr Gly 
65 70 75 80 

Ser Trp Ser Thr Leu Lys Thr Gin Asp Gin Lys Thr Val Arg Lys Ala 

85 90 95 

Glu Cys Arg Ala lie His Cys Pro Arg Pro His Asp Phe Glu Asn Gly 

100 105 110 

Glu Tyr Trp Pro Arg Ser Pro Tyr Tyr Asn Val Ser Asp Glu lie Ser 

115 120 125 

Phe His Cys Tyr Asp Gly Tyr Thr Leu Arg Gly Ser Ala Asn Arg Thr 

130 135 140 

Cys Gin Val Asn Gly Arg Trp ser Gly Gin Thr Ala lie Cys Asp Asn 
145 150 155 160 

Gly Ala Gly Tyr Cys Ser Asn Pro Gly lie Pro lie Gly Thr Arg Lys 

165 170 175 

Val Gly Ser Gin Tyr Arg Leu Glu Asp Ser Val Thr Tyr His Cys Ser 

180 185 190 

Arg Gly Leu Thr Leu Arg Gly Ser Gin Arg Arg Thr Cys Gin Glu Gly 

195 200 205 

Gly Ser Trp Ser Gly Thr Glu Pro Ser Cys Gin Asp Ser Phe Met Tyr 

210 215 220 

Asp Thr Pro Gin Glu Val Ala Glu Ala Phe Leu Ser Ser Leu Thr Glu 
225 230 235 240 

Thr He Glu Gly Val Asp Ala Glu Asp Gly His Gly Pro Gly Glu Gin 

245 250 255 

Gin Lys Arg Lys He Val Leu Asp Pro Ser Gly Ser Met Asn He Tyr 

260 265 270 

Leu Val Leu Asp Gly Ser Asp Ser He Gly Ala Ser Asn Phe Thr Gly 

275 280 285 

Ala Lys Lys Cys Leu Val Asn Leu He Glu Lys Val Ala Ser Tyr Gly 

290 295 300 

Val Lys Pro Arg Tyr Gly Leu Val Thr Tyr Ala Thr Tyr Pro Lys He 
305 310 315 320 

Trp Val Lys Val Ser Glu Ala Asp Ser Ser Asn Ala Asp Trp Val Thr 

325 330 335 

Lys Gin Leu Asn Glu He Asn Tyr Glu Asp His Lys Leu Lys Ser Gly 

340 345 350 

Thr Asn Thr Lys Lys Ala Leu Gin Ala Val Tyr Ser Met Met Ser Trp 

355 360 365 

Pro Asd Asp Val Pro Pro Glu Gly Trp Asn Arg Thr Arg His Val He 

370 375 380 

He Leu Met Thr Asp Glv Leu His Asn Met Gly Gly Asp Pro He Thr 
385 390 39*5 " 400 

Val He Asp Glu He Arg Asp Leu Leu Tyr He Gly Lys Asp Arg Lys 

405 410 415 

Asn Pro Arg Glu Asp Tyr Leu Asp Val Tyr Val Phe Gly Val Gly Pro 

420 425 430 

Leu Val Asn Gin Val Asn He Asn Ala Leu Ala Ser Lys Lys Asp Asn 

435 440 445 

Glu Gin His Val Phe Lys Val Lys Asp Met Glu Asn Leu Glu Asp Val 
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450 455 460 

Phe Tyr Gin Met lie Asp Glu Ser Gin Ser Leu Ser Leu Cys Gly Met 
465 470 475 " 480 

Val Trp Glu His Arg Lys Gly Thr Asp Tyr His Lys Gin Pro Trp Gin 

485 490 495 

Ala Lys He Ser Val He Arg Pro Ser Lys Gly His Glu Ser Cys Met 

500 505 510 

Gly Ala Val Val Ser Glu Tyr Phe Val Leu Thr Ala Ala His Cys Phe 

515 520 525 

Thr Val Asp Asd Lys Glu His Ser He Lys Val Ser Val Gly Gly Glu 

530 535 540 

Lys Arg Asp Leu Glu He Glu Val Val Leu Phe His Pro Asn Tyr Asn 
545 550 555 560 

He Asn Gly Lys Lys Glu Ala Gly He Pro Glu Phe Tyr Asd Tvr Asp 

565 570 " 575 

Val Ala Leu He Lys Leu Lys Asn Lys Leu Lys Tvr Gly Gin Thr He 

580 585 * 590 

Arg Pro He Cys Leu Pro Cys Thr Glu Gly Thr Thr Arg Ala Leu Arg 

595 600 605 

Leu Pro Pro Thr Thr Thr Cys Gin Gin Gin Lys Glu Glu Leu Leu Pro 

610 615 620 

Ala Gin Asp He Lys Ala Leu Phe Val Ser Glu Glu Glu Lys Lys Leu 
625 630 635 640 

Thr Arg Lys Glu Val Tyr He Lys Asn Gly Asp Lys Lys Glv Ser Cys 

645 " 650 * 655 

Glu Arg Asp Ala Gin Tyr Ala Pro Gly Tyr Asp Lys Val Lvs Asp He 

660 665 670 

Ser Glu Val Val Thr Pro Arg Phe Leu Cys Thr Gly Gly Val Ser Pro 

675 680 665 

Tyr Ala Asp Pro Asn Thr Cvs Arg Gly Asp Ser Gly Gly Pro Leu He 

690 695 700 

Val His Lys Arg Ser Arg Phe He Gin Val Gly Val He Ser Trp Gly 
705 710 715 720 

Val Val Asp Val Cys Lys Asn Gin Lys Arg Gin Lys Gin Val Pro Ala 

725 730 735 

His Ala Arg Asp Phe His He Asn Leu Phe Gin Val Leu Pro Trp Leu 

740 745 750 

Lys Glu Lys Leu Gin Asp Glu Asp Leu Gly Phe Leu 
755 760 



(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE : 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:48: 



Met Glu 


Ser 


Pro 


Gin 


Leu 


Cys 


Leu 


val 


Leu 


Leu 


Val Leu Gly Phe ser 


1 






5 










10 




15 


Ser Gly 


Gly 


Val 


Ser 


Ala 


Thr 


Pro 


Val 


Leu 


Glu 


Ala Arg Pro Gin Val 






20 










25 






30 


Ser Cys 


Ser 


Leu 


Glu 


Gly 


Val 


Glu 


He 


Lys 


Gly 


Gly Ser Phe Gin Leu 




35 










40 








45 


Leu Gin 


Gly 


Gly 


Gin 


Ala 


Leu 


Glu 


Tyr 


Leu 


Cys 


Pro Ser Gly Phe Tyr 


50 










55 










60 


Pro Tyr 


Pro 


Val 


Gin 


Thr 


Arg 


Thr 


Cys 


Arg 


Ser 


Thr Gly Ser Trp Ser 


65 








70 










75 


80 


Asp Leu 


Gin 


Thr 


Arg 


Asp 


Gin 


Lys 


He 


Val 


Gin 


Lys Ala Glu Cys Arg 








85 










90 




95 


Ala He 


Arg 


Cys 


Pro 


Arg 


Pro 


Gin 


Asd 


Phe 


Glu 


Asn Gly Glu Phe Trp 






100 










105 






110 
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Pro Arg Ser Pro Phe Tyr Asn Leu Ser Asp Gin lie Ser Phe Gin Cys 

115 120 125 

Tvr Asp Gly Tyr Val Leu Arg Gly Ser Ala Asn Arg Thr Cys Gin Glu 

13 5 140 
Asn Gly Arg Trp Asp Gly Gin Thr Ala lie Cys Asp Asp Gly Ala Gly 

« .r 150 

iyr cys Pro Asn Pro Gly He Pro He Gly Thr Arg Lys Val Gly Ser 

165 170 
Gin Tyr Arg Leu Glu Asp lie Val Thr Tyr His Cys Ser Arg Gly Leu 

Val Leu Arg Gly Ser Gin Lys Arg Lys Cys Gin Glu Gly Gly Ser Trp 

195 200 205 

ser Gly Thr Glu Pro Ser Cys Gin Asp Ser Phe Met Tyr Asp Ser Pro 

ota 215 220 

Gin Glu Val Ala Glu Ala Phe Leu Ser Ser Leu Thr Glu Thr He Glu 

230 

Gly Ala Asp Ala Glu Asp Gly His Ser Pro Gly Glu Gin Gin Lys Arg 

Lys lie Val Leu Asp Pro Ser Gly Ser Met Asn lie Tyr Leu Val Leu 

Asp Gly Ser Asp Ser lie Gly Ser Ser Asn Phe Thr Gly Ala Lys Arg 

275 280 285 

Cys Leu Thr Asn Leu lie Glu Lys Val Ala Ser Tyr Gly Val Arg Pro 

OQfi 295 300 

Arg Tyr Gly Leu Leu Thr Tyr Ala Thr Val Pro Lys Val Leu Val Arg 

310 315 
Val Ser Asp Glu Arg Ser Ser Asp Ala Asp Trp Val Thr Glu Lys Leu 

Asn Gin He Ser Tyr Glu Asp His Lys Leu Lys Ser Gly Thr Asn Thr 

340 ^45 J5U 

Lys Arg Ala Leu Gin Ala Val Tyr Ser Met Met Ser Trp Ala Gly Asp 
355 360 36b 



Ala Pro Pro Glu Gly Trp Asn Arg Thr Arg His Val He He He Met 

■*7ft 375 380 

Thr Asp Gly Leu His Asn Met Gly Gly Asn Pro Val Thr Val He Gin 

;pr 390 " 5 

Asp He Arg Ala Leu Leu Asp He Gly Arg Asp Pro Lys Asn Pro Arg 
405 

Glu Asp Tyr Leu Asp Val Tyr Val Phe Gly Val Gly Pro Leu Val Asp 

420 425 
Ser val Asn He Asn Ala Leu Ala Ser Lys Lys Asp Asn Glu His His 

Val Phe Lys Val Lys Asp Met llu Asp Leu Glu Asn Val Phe Tyr Gin 

Met He Asp Glu Thr Lys Ser Leu Ser Leu Cys Gly Met Val Trp Glu 

h" Lys Lys Gly Asn Asp Tyr His Lys Gin Pro Trp Gin Ala Lys He 

485 "0 " 3 

Ser Val Thr Arg Pro Leu Lys Gly His Glu Thr Cys Met Gly Ala Val 

500 505 aiv 

Val Ser Glu Tyr Phe Val Leu Thr Ala Ala His Cys Phe Met Val Asp 

ci c 520 5 " 

Asp Gin Lys His Ser He Lys Val Ser Val Gly Gly Gin Arg Arg Asp 

Leu llu He Glu Glu Val Leu Phe His Pro Lys Tyr Asn He Asn Gly 

cac 550 555 

Lys Lys Ala Glu Gly He Pro Glu Phe Tyr Asp Tyr Asp Val Ala Leu 

565 ^70 3/3 

Val Lys Leu Lys Asn Lys Leu Lys Tyr Gly Gin Thr Leu Arg Pro He 

Cys Leu Pro Cys Thr Glu Gly Thr Thr Arg Ala Leu Arg Leu Pro Gin 

Thr Ala Tnr Cys Lys Gin His Lys Glu Gin Leu Leu Pro Val Lys Asp 

6 15 ^ 
Val Lys Ala Leu Phe Val Ser Glu Gin Gly Lys ser Leu Thr Arg Lys 

III Val Tyr He Lys Asn Gly Asp Lys Lys Ala Ser Cys Glu Arg Asp 

Ala Thr Lys Ala Gin Gly Tyr Glu Lys val Lys Asp Ala Ser Glu Val 
660 665 

SUBSTITUTE SHEET (RULE 26) 



WO 97/01578 



PCT/US96/10958 



•115- 



Val 


Thr 


Pro 


Arg Phe Leu Cys Thr 






675 


680 


Pro 


Asn 


Thr 


Cys Lys Gly Asp Ser 




690 




695 


Arg 


Ser Arg 


Phe lie Gin Val Gly 


705 






710 


Val 


Cys 


Arg 


Asp Gin Arg Arg Gin 








725 


Asp 


Phe 


His 


lie Asn Leu Phe Gin 








740 


Leu 


Lys 


Asp 


Glu Asp Leu Gly Phe 






755 


760 



Gly Gly Val Asp Pro Tyr Ala Asp 
665 

Gly Gly Pro Leu He Val His Lys 
700 

Val He Ser Trp Gly Val Val Asp 
715 720 
Gin Leu Val Pro Ser Tyr Ala Arg 

730 735 
Val Leu Pro Tro Leu Lys Asp Lys 
745 ~ 750 

Leu 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 737 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

Met Thr Ser Met Glu Cys Gly Leu Arg Leu Lys Trp Leu He Leu Ala 

15 io is 

Leu He Cys Pro Leu Thr Ala Gly Ala Pro Ser Arg Glu Gly Ser Cys 

20 25 30 

Pro Glu Glu Asn Leu Asp He Ala Gly Gly Ser Phe Thr Leu Ser Asn 

35 40 45 

Gly Tyr Ser Asp Gly Ser Tyr Leu Gin Tyr He Cys Pro Asp Asn His 

50 55 60 

Tyr Pro Ser lie Ser Ser Arg Arg Cys Gin Phe Gly Val Trp Thr Pro 

70 75 80 

Lys Ala Ser Ser Arg Lys Lys Ala Glu Cys Lys Lys He Thr Cys Pro 

85 90 95 

Asn Pro Arg Val Leu Glu Asn Gly Glu Val Ala Pro Tyr Gin Glu Arg 

100 105 no 

Tyr Tyr He Asn Asp Val Thr Thr Tyr Ser Cys Ser Ser Asp Tyr Lys 

H5 120 125 

Phe Arg Gly Ser Lys Val Arg Val Cys Gin Pro Asn Gly Lys Trp Asn 

130 135 140 

Gly Ser Thr Pro He Cys Gly Arg Asp Ser Asp His Cys Pro Asp Pro 
145 150 155 160 

Gly Val Pro Pro Gly Ser Ser Arg Thr Gly Ser He Phe Asn He Asp 

165 170 175 

Asp Glu Val Thr Tyr His Cys Asp Ser Pro Leu Thr Leu He Gly Ser 

1B0 185 190 

Lys val Arg Ser Val Trp Met Tyr Gly Gin Trp Ser Gly Thr Glu Pro 

195 200 205 

Gin Cys Tyr Ala Asp Phe Thr Tyr Asp Pro Ala Met Glu Ala Ala Glu 

210 215 220 

Ala Phe Gly Asn Ser Leu Thr Thr Thr Leu Thr Val Gin Gin Gly Phe 
225 230 235 240 

Glu Asp Asp Gin His Gly Lys Lys He Ser Leu Asp Arg Gly Gly Lys 

245 250 255 

Leu Asp lie Tyr He Ala Val Asp Ala Ser Asp Ser He Asp Pro Lys 

260 265 270 

Asp Phe Asp Lys Ala Lys Lys lie He Lys Thr Leu He Glu Lys He 

275 280 285 

Ser Tyr Tyr Glu Val Ser Pro Asn Tvr Glu lie Leu Met Phe Ala Thr 

290 295 300 

Asp Val Asp Gin lie Val Lys Met Arg Asp Phe Lvs Thr Asn Glu Lys 
305 310 315 * 320 

Ala Arg Lys lie Leu Lys lie Phe Glu Asp Leu Asp Asn Phe Asn Tyr 
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325 330 335 

Asp Lys Lys Gly Asp Arg Thr Gly Thr Asn lie Ala Lys Leu Tyr Leu 

340 345 350 

Lys lie Leu Asd Ser Met Ser Leu Glu Gin Val Gin Asn Lys Glu Asp 

355 360 365 

Phe Leu Gin Thr Gin His Val lie He Val Phe Thr Asp Gly Gin Ala 

370 375 380 

Asn Met Gly Gly Asn Pro Lys Pro Lys Val Asd Leu He Lys Asn Leu 
385 390 395 400 

Val He Lys Asn Asn Ala Ser Arg Glu Asn Lys Leu Asp Leu Tyr Val 

405 410 415 

Phe Gly Val Gly Lys Asp Val Lys Lys Glu Asp Met Asn Gly Leu Val 

420 425 430 

Ser Glu Lys Lys Asp Glu Arg His Phe Phe Lys Leu Pro Asp Leu Asp 

435 440 445 

Glu Val Gin Asn Thr Phe Asp Leu Met Leu Asp Asd Ser Thr Val Val 

450 455 " 460 

Gly Leu Cys Gly Met Gin Gin Asn Tyr Asp Gly Ser Asn Lys Arg Ser 
465 470 475 480 

Ala Tyr Pro Trp Leu Ala Gin Leu Ser He Ala Gin Ser Gin He Ser 

485 490 495 

Asp Cys Met Gly Ser Leu Val Thr Ser Arg Tyr He Leu Thr Ala Ala 

500 505 510 

His Cys Phe Lys Glu Gly Asp Thr Pro Asp Lys He Thr Val Tyr Leu 

515 520 525 

Glu Lys Asn Thr Asp Val Lys Val Glu Lys Val Phe He His Pro Asn 

530 535 540 

Tyr Ser Leu Thr Ala Lys Gin Ser lie Gly He Lys Glu Phe Tyr Asp 
545 550 555 560 

Phe Asp Val Ala Leu Leu Gin Leu Lys Thr Pro Val Lys Met Ser Val 

565 570 575 

Asn Leu Arg Pro He Cys Leu Pro Cys Thr Lys Glu Thr Asn Arg Ala 

S80 585 590 

Leu Lys Leu Ser Asp Ser Gin Gly Thr Cys Glu Lys His Glu Gin He 

595 600 605 

Leu Leu Ser Asn Glu Leu Val Asp Ala Ala Phe Thr Ser Lys Met Asp 

610 615 620 

Met Glu Lys Arg Ser Pro Arg Lys He Arg Arg He Thr Val Lys Leu 
625 630 635 640 

Gly Lys Tyr Leu Asp Ala Cys Val Glu Asp Ala Lys Lys Ala Lys Glu 

645 650 655 

Ser Lys Trp Gin Met Arg Arg Arg Gin Leu Gin Lys He Ser Cys Gly 

660 665 670 

Ser Gly Gly Asn Gin Pro Gin Arg Asp Asp Val Ser Cys Lys Gly Glu 

675 680 685 

Ser Gly Gly Ala Thr His Val Asp Lys Tyr Gly Arg Leu He Gin He 

690 695 700 

Gly Val Val Ser Trp Gly Val Lys Asn Leu Cys Ser Lys Lys Arg Asn 
705 710 715 720 

Leu Met Gin Phe Ser Val Ser Asp Ser Arg Asp Tyr His He Asn Pro 
725 730 735 

Phe 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oeptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 
(Vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
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Asp Ala Thr Met Ser He Leu Asp He Ser Met Met Thr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Asp Ala Thr Met Ser He Leu Asp He Ser Met Met Thr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:S2: 

Asp Gin Asp Ala Thr Met Ser He Leu Asp He Ser Met Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Ser He Leu Asp He Ser Met Met Thr Gly Phe Ala Pro Asp Thr 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Lys Ala Phe Ser Asp Arg Asn Thr Leu lie lie Tyr Leu Asp 
1 5* 10 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
( iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 55: 

Glu Val Val Ala Asp Ser Val Trp Val Asd Val Lys Asp Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:56: 

Ser Glu Phe Pro Glu Ser Trp Leu Trp Asn Val Glu Asp Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 57; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

Leu Ser Ser Asp Phe Trp Gly Glu Lys Pro Asn Leu Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 14 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Val Asn Phe Leu Leu Arg Met Asp Arg Ala His Glu Ala Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I SENSE : NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Ala Gin Gly Asp Val Pro Val Thr Val Thr Val His Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Ser Gly Gin Arg Glu Val Val Ala Asp Ser Val Trp Val Asp Val 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 
Thr lie Pro Ala Asn Arg Glu Phe Lys Ser Glu Lys Gly Arg 
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15 10 

(2) INFORMATION FOR SEQ ID NO: €2: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv> ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Ser lie Thr Val Arg Thr Lys Lys Gin Glu Leu Ser Glu Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Asp Leu Lys Glu Pro Pro Lys Asn Gly lie Ser Thr Lys Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

Gly Asd Gly Val Ala Lys Leu Ser He Asn Thr His Pro Ser 
1*5 10 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 



SUBSTITUTE SHEET (RULE 26) 



WO 97/01578 



PCT/US96/10958 



-121- 



(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 65: 

Glu Arg Leu Gly Arg Glu Glv Val Gin Lys Glu Asp lie 
15 10 

(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1-i ammo acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Tyr Tvr Thr Leu lie Gly Ala Ser Gly Gin Arg Glu Val Val 
1 \ 5 10 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Asp Gly Ser Pro Ala Tvr Arg Val Pro Val Ala Val Gin Gly Glu 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Arg Leu Leu Lys Ala Gly Arg Gin Val Arg Glu Pro Gly Gin 
15 10 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : peptide 
(iiij HYPOTHETICAL: NO 
(IV) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Pro Lys Ser Ser Leu Ser Val Pro Tyr Val lie Val Pro 
15 10 



(2) INFORMATION FOR SEQ ID NO: 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peDtide 

(iii) HYPOTHETICAL: NO* 
liv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Gin Val Asn Ser Leu Pro Gly Ser He Thr Lys Ala Gly Asp 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 71: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:7i: 



Thr Val Leu Thr Pro Ala Thr Asn His Met Gly Asn Val Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 72: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Glu Val Gin Leu Val Ala His Ser Pro Trp Leu Lvs Asp Ser 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Thr Ser Asp Leu Asp Pro Ser Lys Ser Val Thr Arg Val Asp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide \ 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Thr Ala Glu Leu Val Ser Asp Ser Val Trp Leu Asn lie Glu 
X 5 10 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 75: 

Met Arg Leu Leu Gly Met Glu Thr Met Ala Trp Gin Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (RULE 26) 



WO 97/01578 



PCT/US96/10958 



•124- 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Arg Glu lie Leu Asn lie Asn Gin Lys Arg Asn Asd Tyr 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 77: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Trp Arg Val Asn Val Gly Asp Pro Lys Ser Gin Trp Gly Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 
(Vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Leu Lys Thr Ser lie Gly Asn Lys Pro Pro Glu Lys Leu Asp 
15 10 



(2) INFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Cys Arg Leu Leu Lys Ala Gly Arg Gin Val Arg Glu Pro Gly Gin Cys 
15 10 15 
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I CLAIM: 

1. A method for identifying molecules that affect biological activity of a target protein, 
comprising the steps of: 

obtaining information regarding the location of an indel in an amino acid sequence of a 
5 target protein; 

obtaining a peptide fragment of said target protein, said peptide fragment having a 
sequence that is located in said amino acid sequence of said target protein within about 30 amino 
acid residues or less of said indel or obtaining a peptidomimetic or peptide analog of said peptide 
fragment; and 

1 0 screening said peptide fragment, said peptidomimetic, or said peptide analog for its affect 

on biological or biochemical activity of said target protein. 

2. The method of Claim 1, wherein said screening step comprises analyzing for modulation 
of protein activity, inhibition of protein activity, activation or potentiation of protein activity, competition 
for binding to a protein, binding to a protein or iigand, substitution for a substrate of said target protein, 

15 substitution for a iigand of said target protein, or making an anti-peptide antibody capable of modulating 

a biological activity of said target protein. 

3. The method of Claim 1, wherein said peptide fragment, said peptidomimetic, or said 
peptide analog directly affects said target protein. 

4. The method of Claim 1, wherein said peptide fragment, said peptidomimetic, or said 
20 peptide analog indirectly affects said target protein. 

5. The method of Claim 1, further comprising the step of synthetically constructing a 
peptide, peptide analog, or peptidomimetic that affects the biological or biochemical activity of said target 
protein. 

6. A method for making a pharmaceutical composition, comorising the steps of: 

25 obtaining a molecule identified as having biological or biochemical activity in accordance 

with the method of Claim 1; and 

combining said molecule with a pharmaceutically-acceptable carrier. 

7. The method of Claim 6, further comprising the step of packaging said molecule in unit- 
dosage form. 

30 8. The method of Claim 1, wherein said target protein is a protein of the mammalian 

complement system. 

9. The method of Claim 6. wherein said protein of the mammalian complement system is 
CZ C3, C4, C5 or Factor B. 
35 10. A method of identifying interface peptides for a target protein, comprising: 

identifying the location of an indel in an amino acid sequence of a target protein; 
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selecting an amino acid sequence from said target protein sequence overlapping or 
located within about 30 amino acid residues or less of an amino* or carboxykerminus of an indefc 

obtaining a molecule that is a peptide having said selected amino acid sequence, a 
5 peptide analog of said selected amino acid sequence, or a peptidomimetic of said selected ammo 

acid sequence; and 

evaluating said peptide, peptide analog or peptidomimetic in an assay to measure a 
change in activity of said target protein, wherein said change in activity is mediated directly or 
indirectly by said peptide, peptide analog or peptidomimetic. 
10 11. The method of Claim 1 or 10, wherein the peptide fragment has a sequence of about 

4 to about 20 amino acid residues, 10 to about 20 amino acid residues, about 4 to about 15 amino acid 
residues, or about 5 to about 18 amino acid residues in length. 

12. The method of Claim 1 or Claim 10, wherein the peptide fragment is located within 
about 20 amino acid residues, about 15 amino acid residues, about 12 amino acid residues, about 10 amino 

15 acid residues, about 8 amino acid residues or less of an indel. 

13. The method of Claim 1. wherein said peptide fragment has a sequence that spans said 

indel. 

14. The method of Claim 1, wherein said peptide fragment has a sequence located within 
said indel. 

20 15. The method of Claim 10, further comprising the step of making antibodies to said 

peptide, peptide analog or peptidomimetic, wherein said antibodies are capable of modulating an activity 
of said target protein. 

16. The method of Claim 10, wherein the change in activity of said target protein is a 
decrease in activity, an increase in activity, utilization of a substrate different than the substrate normally 

25 utilized by said target protein, or binding to a ligand differently than the ligand binding activity ordinarily 

demonstrated by said target protein. 

17. The method of Claim 10, wherein said target protein is a protein of the mammalian 
complement system including C2, C3, C4, C5 or Factor B. 

18. A peptide for modulating activity of the complement system of a mammal, comprising 
30 a sequence of about 4 to about 25 amino acid residues that occurs in an amino acid sequence of a 

mammalian complement protein, said peptide having an amino acid sequence in which an amino- or carboxyl* 
terminal residue is located within about 1 5 amino acid residues of an indel of said mammalian complement 
protein. 

19. The peptide of Claim IB, wherein said peptide modulates activity of the mammalian 
35 complement system by directly or indirectly inhibiting an activity of a protein of the mammalian complement 

system. 
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20. The peptide of Claim 18. wherein said peptide modulates activity of the mammalian 
complement system by directly or indirectly enhancing an activity of a protein of the mammalian 
complement system. 

21. The peptide of Claim 18, wherein said indel occurs within the amino acid sequence of 
5 a C2, C3, C4, C5 or Factor 8 protein. 

22. The peptide of Claim 18, further comprising another molecule attached to said peptide. 

23. An antibody that specifically recognizes a peptide according to Claim 18. 

24. A peptide analog or peptidomimetic molecule of a peptide according to Claim 18. 

25. A pharmaceutical composition comprising a peptide according to Claim 18. 

10 26. A pharmaceutical composition comprising a peptide analog or peptidomimetic molecule 

according to Claim 48. 

27. A pharmaceutical composition comprising an antibody according to Claim 47. 

28. A pharmaceutical composition comprising an antibody according to Claim 49. 

29. The peptide of Claim 18, wherein said peptide is located within about 10 amino acid 
15 residues of an indel of said mammalian complement protein. 

30. A peptide having an amino acid sequence of SEQ ID N0:53, SEQ 10 N0:54, SEQ ID 
NO:55, SEQ ID N0:56, SEQ ID N0:57, SEQ ID N0:58, SEQ ID N0:72 ( SEQ ID N0:73, SEQ ID N0:74 or SEQ 
ID N0:75. 
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SEQ. ID PROTIEN PEPTIDE 

NO: { 

1 humC3 VPVAVQGED***********TVQSLTOG*DG 

2 musC3 VLVVTQGS************NAKALTQD*DG 

3 humC4 IPVKVSATVSSP****GSVPEAQDIOQNTDG 

4 musC4 VPVKVSATLVS*****GSDSQVLDIQQSTNG 

5 humC5 VPVILNAQTIDVNQETSDLDPSKSVTRVDDG 

6 musC5 VPVTLMAQTVDVNQETSDLETKRSITHDTDG 
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